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Introduction 



The so-called boundary element methods BEM, i.e. finite element approxima- 
tions of boundary integral equations have been improved recently even more 
vividly then ever before and found some remarkable support by the German 
Research Foundation DFG in the just finished Priority Research Program 
“boundary element methods” . 

When this program began, we could start fi:om several already existing 
particular activities which then during the six years initiated many new re- 
sults and decisive new developments in theory and algorithms. The program 
was started due to encouragement by E. Stein, when most of the later par- 
ticipants met in Stuttgart at a Boundary Element Conference 1987. Then 
W. Hackbusch, G. Kuhn, S. Wagner and W. Wendland were entrusted with 
writing the proposal which was 1988 presented at the German Research Foun- 
dation and started in 1989 with 14 projects at 11 different universities. After 
German unification, the program was heavily extended by six more projects, 
four of which located in Eastern Germany. 

When we started, we were longing for the following goals: 

1. Mathematicians and engineers should do joint research. 

2. Methods and computational algorithms should be streamlined with re- 
spect to the new computer architectures of vector and parallel computers. 

3. The asymptotic error analysis of boundary element methods should be 
further developed. 

4. Non-linear material laws should be taken care of by boundary element 
methods for crack-mechanics. 

5. The coupling of finite boundary elements should be improved. 

6. The traditional aerodynamical panel methods should be extended to 
transonic flow computations. 

7. Boundary element methods should be used for solving free boundary 
value problems. 

8. Adaptivity criterions and error estimators should be found and adaptive 
methods should be created. 

9. Boundary element methods for transient and general time-dependent 
problems should be developed. 

During the six years of support, several of these key-issues could suc- 
cessfully be accomplished and, clearly, many new ideas, techniques and open 
problems arose. Particular enhancement is due to the unification of Germany 
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since several excellent research teams from East Germany contributed signifi- 
cantly to new boundary element analyses and algorithms. Also the exploita- 
tion of the new generation of computers, in particular, of parallel machines 
has triggered an overwhelming improvement of BEM as well as of their ap- 
plicability. In spite of the enormous progress, there is much more potential in 
the BEM for future analysis and simulation of complex problems in physics 
and engineering. Looking at the applications in solid mechanics, one can say 
that here the BEM have left the experimental state; the second generation of 
BEM software for engineering applications was produced. On the other hand, 
we can now see lots of new questions and problems; and also the number of 
fields for the application of BEM has increased significantly. 

Research of the following specific topics was supported: 

1. Efficient methods for three-dimensional acoustics. (Antes in Brunswick 
(Braunschweig)) 

2. BEM-FEM discretizations of solids with generalized visco-elastic material 
laws. (Gaul and Zastrau in Hamburg, Stuttgart and Wuppertal) 

3. Extension of cluster-techniques and numerical quadrature. (Hackbusch 
in Kiel) 

4. Hypersingular integrals and adaptive methods for the Kirchhoff-plate. 
(Hartmann in Kassel) 

5. Bimetal problems. (Jentsch in Chemnitz) 

6. The treatment of three-dimensional elasto-plastic notch- and crack- 
problems with boundary elements. (Kuhn in Erlangen) 

7. Adaptive domain decomposition with finite and boundary elements. 
(Langer in Chemnitz and Linz/ Austria) 

8. Boundary element methods for nonlinear surface waves and the coupling 
of rigid elastic structures; with and without viscosity. (Mahrenholtz and 
Schlegel in Hamburg) 

9. Integral equations for instationary flows around profiles. (Meister in 
Darmstadt) 

10. Approximation methods for one and multi- dimensional pseudodifferen- 
tial and Mellin equations/ Algorithms for the wavelet approximation of 
integral equations. (Profidorf in Berlin) 

11. The computation of three-dimensional singularities in elastic structures. 
(Schnack in Karlsruhe) 

12. The coupling of FEM and BEM for three-dimensional elastic structures. 
(Schnack in Karlsruhe) 

13. The numerical treatment of problems in mechanics on heterogeneous 
computer architectures. (Schnack in Karlsruhe) 

14. Collocation methods for singular equations with conjugation/ Stability of 
approximation methods for pseudo- differential operators. (Silbermann in 
Chemnitz) 

15. The continuous coupling of BEM and FEM in the Trefftz sense/FEM- 
BEM coupling in view of parallelization. (Stein in Hanover (Hannover)) 
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16. Error analysis for BEM. (Stephan in Hanover (Hannover)) 

17. The extension of BEM to transonic flow problems by coupling with field 
methods. (Wagner in Munich (Miinchen) / Stuttgart) 

18. Boundary elements of higher order. (Wendland in Stuttgart) 

19. Adaptive BEM. (Wendland in Stuttgart) 

20. Parallel computing and domain decomposition. (Wendland in Stuttgart) 

21. The coupling of nonlinear FEM with BEM in foundation structural en- 
gineering. (Werner in Munich (Miinchen)) 

Based on the original goals, the following main topics came up in our 

priority research group: 

- The coupling of FEM and BEM which initiated several new developments: 

— Traditional coupling by using finite elements in domains of nonlinear 
phenomena with applications to hyperelastic and elasto-plastic stress 
analyses and the extension of boundary element methods to contact 
problems by using variational inequalities. 

— The coupling of field methods in the form of non-linear Newton poten- 
tials with applications to elasto-plastic notch- and crack-problems and 
also to transonic flows by using generalized panel-field methods. 

— Coupling of far fields in form of nonlocal boundary conditions on ficti- 
cious boundaries by using appropriate boundary integral equations for 
the approximation of generalized Dirichlet-Neumann maps with appli- 
cations in acoustics and the computation of stationary transonic flows 
around profiles. 

— The modelling of hybrid Trefftz FEM by using local boundary element 
formulations and the application to problems in solid mechanics involv- 
ing complex geometries as in notch- and crack-problems. 

- The domain decomposition methods are closely intertwined with coupling 

methods which arose from the previous substructural techniques. 

— One variant of the domain decomposition methods are the hybrid Trefftz 
models, the so called macro-elements. Here the decisive improvement 
is based on the weak formulation of the continuity properties on the 
interfaces and the introduction of mortar-elements in combination with 
the use of the Neumann series for the realization of the local Poincare- 
Steklov operators. 

— Domain decompositions are particularly well suited for parallel algo- 
rithms which can also be formulated as multigrid and multilevel methods. 
Parallel algorithms used to be the main aim in several of the projects and 
became more and more useful for the numerical solution of the discrete 
equations also in several more applied projects. These algorithms cre- 
ated a new quality of BEM and brought about new opportunities for the 
simulation of much more complicated complex engineering problems. In 
order to use CG or GMRES iteration for the large systems of equations 
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efficiently, one needs appropriate and effective preconditionings which 
have been developed also in several of the projects. 

- In traditional BEM, the computational effort for generating the influence 
matrix, needed about 90% of the overall computing time since numerical 
integration of the singular and almost singular integrals was performed by 
non-appropriate standard numerical integration. Thanks to the close col- 
laboration between engineers and mathematicians, we have the impression 
that this problem can now pretty well be controlled: 

— The systematic collection of panels to clusters generalizes classical multi- 
pole expansions and yields fast matrix times vector operations. Here, for 
the regular integrals, one only needs integration of polynomials on the fi- 
nite elements on the boundary. The combination of clustering with multi- 
grid methods provided very efficient solvers for rather complex boundary 
element applications. In several of the engineering projects the cluster 
methods are meanwhile used as solution algorithms. 

— For the singular, hypersingular and weakly singular integrals, special nu- 
merical integration procedures and quadrature formulae were set up with 
only a few integration knots, in combination with local regularizations. 
Appropriate coordinate transformations together with the parity proper- 
ties of the kernels lead to powerful integration routines which now allow 
the efficient implementation of Galerkin methods which for given accu- 
racy became as efficient as traditional collocation methods but which 
are much more robust. For the almost singular integrals, adaptive bi- 
section methods have been created for the numerical integration where 
the asymptotic order of the error and relative accuracy are controlled 
automatically. 

— For the evaluation of the boundary potentials near to the boundary, 
new efficient asymptotic methods have been developed, first for smooth 
boundaries, which seem to be extendible also to boundaries with cor- 
ners and edges. These asymptotic methods are closely related to new 
algorithms for the super convergent postprocessing and computation of 
arbitrary derivatives of the solution, for example of the complete stress- 
field, whose realization only needs additional solution of the discrete 
equations involving only the original influence matrix. 

— The extension of efficient BEM algorithms for the treatment of nonlinear 
problems becomes more and more important. 

— For nonlinear problems of solid mechanics now exist boundary element 
methods for visco-elastic and hyperelastic materials and also for contact 
problems and three-dimensional crack propagation. 

— The panel methods of fluid mechanics have been extended to transonic 
flows. 

— The free boundary value problems of fluid mechanics can now be treated 
with BEM. 
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- Coupling methods for the combination of different physical models, e.g. 
exterior problems with artificial boundary conditions, finite elements in 
the plastic zones and boundary elements in the remaining domain as well 
as finite volumes or finite differences for vorticity densities and bound- 
ary elements for velocities become more and more important for future 
applications of BEM multifield problems. 

The adaptive methods for BEM have been developed in two rather differ- 
ent directions. Both concepts require appropriate indicators. Because of the 
nonlocality and different, possibly negative orders of the boundary integral 
operators as pseudodifferential operators it turned out that finding appro- 
priate and reliable error estimators and adaptivity criteria is a nontrivial 
and not yet completed task for boundary element analysis. However, in our 
priority research groups we could accomplish several decisive results. 

— For unstructured grids, error indicators, grid adaption and adaption in 
the cluster technique could be based on a posteriori estimates and the 
strategy by C. Johnson. 

- For hp-methods, decisive accomplishments could be obtained with John- 
son’s strategy. 

- For h-methods, asymptotic local estimators and indicators of Babuska- 
Rheinboldt type were created. 

— On structured grids, spectral methods on one hand and wavelets on the 
other hand have led us to methods where adaptivity is efficiently used 
as data-compression. 

The approximation of boundary integral equations with wavelets became 
particularly important for some of the mathematicians involved in the pro- 
gram, who have developed the analysis of these methods far beyond our 
original expectations. These results clearly became one of the outstanding 
accomplishments and they will surely generate new BEM of a yet unknown 
quality. 

Of course, one of the main topics of the program was error and stability 
analysis of BEM associated with the whole variety of discretizations as 
collocation, Galerkin methods, qualocation and the combination and fully 
discrete realization of these discretizations. 

— For most of the algorithms, grid refinements with h-methods were used, 
in particular adaptive methods on unstructured grids, the current variant 
of domain decomposition methods and efficient solution procedures of 
the discrete equations like cluster and multigrid methods. Also wavelets 
belong to the class of h-methods which converge of algebraic order if 
h tends to zero. During the six years the activities moved more and 
more from the simpler two-dimensional boundary value problems with 
one-dimensional boundary integral equations to the three-dimensional 
problems of boundary integral equations on two-dimensional surfaces. 
For special equations and regularly graded meshes on special boundary 
surfaces such as plane screens, also for the collocation methods one has 
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obtained many new results concerning stability and convergence. How- 
ever, the breakthrough for the analysis of collocation methods on general 
triangulations was still not possible. 

- However, desicive new results could be found for Galerkin procedures 
including wavelet analysis. Here, the classification and analysis of the 
operators involved including algebraic methods play a central role. Cor- 
respondingly, new error indicators and error estimators could be created. 

- In the contrary to the h-methods, spectral methods and h-p-methods 
converge exponentially. For the h-p-methods, pioneering work was done 
within the program and one can be sure that together with the avail- 
ability of fundamental boundary element software these h-p-methods are 
going to be very important in the future due to their high efficiency. 

- Clearly, a reliable error analysis is always based on fine results in solvabil- 
ity and regularity theory of the basic boundary integral equations which 
partly were also investigated within the Priority Research Program. The 
asymptotic behavior and expansion near to corners, edges and collision 
points of different boundary conditions belong to this theoretical back- 
ground and new results have been obtained in weighted spaces applied 
to anisotropic elasticity including assembled structures with different 
materials. 

- BEM for transient time depending problems based on retarded potentials 
or on time dependent boundary elements, for example with the Faedo- 
Galerkin methods, were treated in some of the projects. Although the 
problems and concepts for time dependent problems became much clearer, 
in this field still a lot of work needs to be done. Here, integral transforms 
for the the transition from the time to the frequency domain and back- 
wards play a more and more important role. Examples are the Fourier and 
Laplace transformation as well as wavelets. The numerical realization of 
these applications are demanding and rather difficult; hence the further de- 
velopment of corresponding algorithms will be one of the most important 
tasks in the near future. More recently, also mixed approximation meth- 
ods have been used for the discretization as integral transformations and 
boundary elements for time dependent problems and mixed Fourier and 
spline trial functions for rotational symmetric problems. Within the re- 
search program, particularly engineering groups have accomplished many 
results for modelling and algorithms of time depending problems. 

- An important duty within the program was the exchange, nursing and 
availability of BEM software generated by the program. W. Hackbusch and 
his staff are now developing a boundary element package which should be 
available pretty soon. Also benchmark problems have been defined serving 
for the comparison of the different boundary element programs. 

It should be mentioned that the program owes many great ideas and 
contributions to the numerous guests of the groups involved. 
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At the end of the program our referees drew rather positive conclusions. 
In particular, the cooperation between engineers and mathematicians was 
remarkably good which was very helpful for the transfer of knowledge between 
these two groups. At the end of six years work we can say that there is now 
a rigorous and innovative basis for the BEM. Nevertheless, there is a great 
need for the transfer of this fundamental knowledge to corresponding software 
tools usable by the industry. However, there is no institution to carry through 
this duty which might be a particular problem in Germany. In addition, we 
all are very worried about the current tendency to close down whole areas of 
industrial research. 

During the six years period we organized annually one workshop for the 
whole group and, in addition, several particular colloquia and meetings on 
specific topics. 

Herewith all the participants want to express their sincere gratitude to 
the German Research Foundation and, in particular, to all the people and 
the referees who helped to make this marvelous six years program happen. 

In this volume we publish most of the final reports of this program which 
will serve as information basis on corresponding literature and on the research 
groups which participated. I am very grateful to the Springer Verlag for its 
patience and the opportunity to publish this book. Sincere thanks are due to 
H. Schulz and all the others who helped preparing the manuscript. 



Stuttgart, November 1996 



Wolfgang L. Wendland 
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1 Introduction 



A wide class of problems related to wave propagation phenomena in acoustics 
can be treated by applying the linear theory. The governing equations can 
be derived by combining the hydrodynamic equations of motion and the 
local mass balance, the continuity equation. In order to obtain a linearized 
form of the resulting equations, all the physical quantities and their spatial 
derivatives are considered to be small variations of their undisturbed position. 
Furthermore, the fluid should be in rest, i.e., there shouldn’t be any steady 
flow. Assuming the medium to be an ideal gas and the constitutional change of 
the fluid at the wave front to be adiabatic leads to a relationship between the 
pressure and the density of the fluid. Introducing the latter to the linearized 
forms of the principles mentioned above finally yields the so called wave- 
equation 



div gradp 



15 ^ ... d 



( 1 . 1 ) 



where p denotes the hydrodynamic pressure and c the sonic speed. The non- 
homogeneity on the right hand side depends on the body forces 6, on the 
source density a, and on the density q of the fluid in the original state. Basi- 
cally, there are two approaches for solving such kind of equation: a) solving 
the wave equation directly in the time domain by a time stepping scheme or 
b) after a transformation into frequency domain by solving the Helmholtz 
equation. Especially, when dealing with time harmonic excitations, the for- 
mulation in frequency domain is well suited. This formulation can be found 
by separating the unknowns, where the time dependency is described by the 
function p{x,t) := p{x)e~^^^. Herein, p{x) denotes the complex amplitude, 
Lo the circular frequency and i the imaginary unity i := y/^. Introducing 
this to equation (I. I) yields the Helmholtz equation mentioned above: 



divgradp p — / , 



( 1 . 2 ) 



where k denotes the wave number, k ujc. The paper deals with both, 
the direct approach in time domain as well as the solution process in the 
frequency domain. 
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2 Acoustic problems in frequency domain 



For a wide class of problems arising in acoustics, domains extending to infinity 
are involved. For such kind of problems equation (1.2) is known to yield a 
unique solution provided the boundary conditions: 






<l{x):=~{x) =g{x), 

an r 



r = Tp u r, , Tp n r, = 0 



and special decay conditions, the Sommerfeld radiation conditions: 




are satisfied. The latter prevents outgoing waves from traveling back due to 
reflections at infinity. 

Boundary element methods have turned out to be a very sophisticated tool 
for solving problems of this class. Especially one feature makes them superior 
to other numerical methods: The satisfaction of the Sommerfeld radiation 
condition, i.e. energy radiation is contained in a natural way. Boundary in- 
tegral formulation can be obtained in various ways, for example by applying 
Green’s representation theorem. This leads to the so called direct method 
which reads as follows: 









j (t>{x,y) f(y) dOy 
n 



' 0 
< \p{x) 

, P{‘^) 



for a: G \ i? 
for X G r 
for X £ Q . 



( 2 . 1 ) 



Herein, (j){x^y) denotes the integral kernel which represents the fundamental 
solution for the present case. The formulation (2.1) refers to the exterior 
problem. For points located on the boundary this relation is only valid for 
sufficiently smooth boundaries. With the exception of the integral over the 
known perturbation function /(y), all integrals refer to the boundary only. 
This means that for the numerical approach nothing but the boundary has 
to be discretized. 

Nevertheless, boundary element methods contain some remarkable draw- 
backs. Although only the boundary of the domain under consideration has to 
be discretized, the amount of required storage increases with 0{h^) when h 
denotes the discretization parameter. Compared to the finite element method 
where this quantity increases merely with 0{h^), the boundary element 
method admits only smaller problems to deal with. Especially in frequency 
domain where the mesh parameter h has to be adapted reciprocally pro- 
portional to the frequency, realistic problems can be handled only in a very 
restricted manner. 




Efficient Calculation of Acoustic Fields by Boundary Element Method 



11 



Another critical point in boundary element calculations is the extensive 
consumption of computing time. This is a result a) of the complicated inte- 
gration involved and b) of the linear system of equations to be solved whose 
coefficient matrices are normally fully populated and non-symmetric. For a 
system discretized by the collocation method, figure 2.1 clearly demonstrates 
the dependence of computing time for the integration and for the solution 
of the algebraic system from the number of nodes. This typical example 
shows which actions have to be taken in order to increase the efficiency of 
the method: a fast solver should be employed to reduce the CPU-time from 
0{h^) to a significant lower order. 




3 Multigrid solver 

Substituting the direct solver by an iterative one appears to be convenient 
way to attain such a reduction. In most cases, iterative solvers base on matrix- 
vector multiplications with a numerical effort of 0{h‘^) per iteration sweep, 
provided that the matrices are fully populated. Clearly, the total numeri- 
cal effort depends on the number of iterations required to achieve a desired 
accuracy. 

Linear iterative methods for a system Ax = b can be written in various 
forms. Among these the first (I) and second (II) normal form: 

= (/ - + Sb = M x^''^ + Sb (I) 

(3.1) 

^(u+i) ^ ^ + Sr (II) 

play an important role. Note that the type and the behavior of the iterative 
scheme is completely defined by the choice of S which turns out to be a 
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smoothing operation later. In regard of the convergence analysis the iteration 
operator M has to be studied, i.e., its spectral radius g{M) has to be less 
than one, in order to have a convergent method. 

Most iterative solvers show a specific behavior: they are well suited to 
reduce the short wave part of the error while the reduction of the long wave 
error is rather poor. One possibility to get rid of this drawback was proposed 
by Brandt and Hackbusch in the early 70th. The idea is the combination 
of iterative and direct solvers, the iterative one to reduce the short-wave error, 
the direct one to correct the long- wave part. This procedure is known as the 
multigrid method. 

The following sequence builds the most important multigrid scheme called 
coarse grid correction. 



Coarse grid correction scheme 

(a) smoothing of the short-wave error on the fine grid by means of an iterative 
method: 

xM = xM + S(b - AxM) 

(b) calculation of the residuum on the fine grid: 

= b - 



(c) restriction, that means the transformation of the residuum onto the 
coarse grid with the restriction operator R/: 

r<-) = 

(d) solving the system to obtain e^^^ on the coarse grid by means of a direct 
method: 

Ace(") = r(*^) 

(e) prolongation: Interpolation of the error from the coarse grid back to 
the fine grid with the prolongation operator: P/: 



= P/eM 

(f) Correction of the approximation vector on the fine grid: 

x(-+i) = i(-) + 

This process just describes the two- grid method. Replacing the direct solution 
on the coarse grid in point (d) by a further application of this scheme on the 
next coarser grid leads to the real multigrid method. Here, different types 
of cycles can be defined, especially the V-cycle by applying the coarse grid 
correction once on each grid, or the W-cycle where the coarse grid correction 
is performed twice on each grid. 




Efficient Calculation of Acoustic Fields by Boundary Element Method 13 

One of the most subtle points in multigrid analysis is the smoothing op- 
eration. It should be pointed out that the iterative process used as relaxation 
does not have to converge, but sufficiently reduce the short wave error. This is 
absolutely necessary because the residuum must be as smooth as it is required 
for the approximation on the coarse grid. 

For most radiation and scattering problems in acoustics equation (2.1) 
leads to an integral equation of the second kind. Its discretized form can be 
described by the following matrix equation: 

(AI — K)p = Lq -f- s or, more general: Ax = b. (3.2) 

Now, when such an integral equation is considered, with A — AI-K,A = | 
and the PiCARD-iteration (S = I) acting as smoother, the residuum on the 
fine grid reads: 

rW = 2K{b-(il-K)xW}. (3.3) 

This equation makes it evident that the smoothness of the residuum 
depends directly on the smoothing property of the integral operator K. Now, 
a comparison of equation (3.2) and (2.1) yields that K is made up by the 
double-layer potential. Therefore, K has such property only if the boundary 
is sufficiently smooth. Edges and corners destroy this property. 

However, the smoothing effect can be regained if a suitable splitting for 
K can be established. Such kind of splitting was proposed for example by 
Amini [1]. 



3.1 Operator splitting 

A suitable splitting for K means that K is devided into one part that contains 
the smoothing property and a second one, whose norm is less than one: 

~ . ii^ii < p < 1 contracting 

K — K K with I _ (3.4) 

[ K smoothing . 

This splitting leads to a modification of the smoothing process according to 
the following form: 

{XI-K)x = b {\I-K)x = Kx + b 

(3.5) 

^.(t^+i) ^ + \-^b = SKx^‘''> + Sb. 

Here, A“^ is replaced by S and K hj K, the smoothing p^t of K. Now K 
is smoothing by definition, and due to the condition that K is contracting, 
S', the inverse of {XI — K) exists such that the smoothing character of this 
modified iterative process is guaranteed. 
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For this operator splitting, the convergence considerations given by Hack- 
BUSCH [2] have to be modified. To do this, the whole multigrid scheme is 
considered as an iterative process written in terms of the first normal form. 
Assuming the smoothing iteration to be carried out only once the iteration 
matrix for a coarse grid correction step over two grids reads as follows: 



Mf 



[l-Pj (AI-Ki_i)-iRi(AI-Kj) 



Sik,. 



(3.6) 



Since the multigrid method is represented by a recursive application of the 
two grid method, the formulation for the multigrid iteration matrix is also a 
recursive one. For the V- and W-cycle, the method can be described by the 
the relations: 



= Mf + P, Mfi, (AI - Ki_i)-iR, (AI - Kj) Si Kj (V) 

(3-7) 

Mf^ = Mf + Pi - K,_i)-'Rj (AI - Ki) Si K, (W). 



As the spectral radius is bounded by some suitable norms, the convergence 
behavior of the two- and multigrid cycle is determined by the norm of the 
iteration matrix. For the two-grid method it can be derived from equation 
(3.6). After some algebraic manipulations (see Trondle [3] ) one obtains: 



M" 



< 



I-P/R/ -f 



+ ||Pj|| ||(AI-K;_i)-i|| ||(R,K, -Kj_iR0l|l ||S,|1 ||K,| . 



(3.8) 



This relation shows clearly which conditions have to be satisfied in order to 
obtain a convergent two-grid method. In detail, these are: 

— convergence of P/ R/ towards the identity: ||l — P/ R/|| < Cpr /if 

- P/ has to be bounded, i.e., ||P/|| < Cp, 

- the stability of the coarse grid matrix: ||(AI — K/_i)“^|| < 

— the relative consistency: ||(R/ — K/_i R/)|| < Ck /if, 

— the regularity of the smooth part of the integral operator: ||K/|| < Cr, 

- the existence of a bounded inverse for (AI - K^): ||S/|| < Cs- 

With the constants introduced above, the norm of the two-grid iteration 
matrix can be bounded by: 

||Mf|| < {Cpr + CpCsCk)CsCrh^ = Czhf. (3.9) 

This means that the convergence rate is improved the more the grid param- 
eter decreases. 

Similar considerations lead to the norm of the iteration matrix for the 
multigrid W-cycle which, of course, is defined recursively: 
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IlMril < ||Mf|| + |lPz|||l(M,^Ol|x 

X [||Rj|l 1|I|| - ||(AI - 11 ll(Ri Ki - K(_i R,) 



llSillllKjll (3.10) 



< Czhf + Cp\\(Mfi,f\\ {Cs Ck /if + CnCi) Cs C. . 



Here, Ci denotes the estimation for ||I||xi^yi (the norm of the inclusion when 
K is a mapping K : Xi Yi): ||I||xzf-yi < Ci. The assumption of the above 
considerations was that S the inverse of (AI — K/) is computed exactly. In 
practice, this would mean that at least an exact decomposition has to be 
performed, and for each iteration sweep the full algebraic system 

(AI - ki) x'*') = ki - b, (3.11) 



has to be solved by a backward and forward substitution, respectively. The 
numerical effort for these operation would result in a draistic reduction of 
the multigrid efficiency. Hence, the task is to find a suitable matrix K/ that 
can be decomposed with less numerical effort. One possible way to achieve 
this is to take into account only those nodes lying close to each collocation 
point under consideration. Thereby, the so-called near-field is defined which, 
for its part, defines a special kind of sparsity pattern for K/, i.e., besides the 
main diagonal each row contains only a few entries. Other types of sparsity 
patterns are introduced in [4]. No matter what kind of sparsity pattern is 
used, the size of the decomposition of (AI — K/) can be nearly as large as 
the fully populated system matrix in the worst case. Hence, this would also 
increase the storage requirement of the method to a high degree. 

In the following, we consider two ways as a practical solution of these 
problems: 

- a restriction of K/ to the diagonal of K/ 

— the calculation of the inverse, resp. the decomposition is performed in an 
approximative sense only. 

If the near-field is restricted to the diagonal of K/, the inverse can be obtained 
very easily. In this case, when D/ denotes the diagonal of the system matrix 
A;, {Xl^Ki) = Bi holds and the iteration scheme reads as follows: 

x^ = Df'K,x["^ + Dj-'b. (3.12) 

Converting this scheme to the second normal form in (3.1) leads to 

xj"' = (bj - A/xf) (3.13) 

which shows that this equation means nothing but the JACOBi-iteration 
scheme. 

The second method under consideration, in the following called “near- 
field ILU smoother”, differs from the above described way by the effect, that 
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the decomposition of the near-field matrix {XI — Ki) is performed only incom- 
pletely (indicated by the tilde): (AI - Ki) « (LU)/. With this the smoothing 
iterations reads: 

= (LU)fi -bj) . (3.14) 

The representation in the second normal form 

(bj-A/xj")) . (3.15) 

turns out to be more convenient with regards to the implementation in a 
computer-code. 

In order to demonstrate the efficiency of this type of smoother, we con- 
sider an example with a model which contains many edges and corners (Fig. 
3.1). This study is based on a two-grid method, and the criterion for the 
comparison is given by the residuum r relative to the right hand side of the 
system of equations: r* = |j^. It is computed from time to time after 5 it- 
eration sweeps and, therefore, represents some kind of truncation error. This 
truncation error is plotted versus the excitation frequency, given in figure 
3.1 by the HELMHOLTZ-number na. The diagram demonstrates clearly that 
the convergence behavior of the two-grid method is much improved when the 
ILU-near-field smoother or the jACOBi-smoother is used. Compared to these 
the two-grid convergence using PiCARD-iteration which doesn’t contain any 
operator splitting is rather poor. Moreover, for higher excitation frequencies 
the smoothing properties of the ILU-smoother are superior to the others, 
which leads to an improved convergence of the method. 




r 

1,0 1,5 2,0 

Helmholtz-number Ka 



Fig. 3.1. Convergence of multigrid schemes with different smoothers 
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4 Adaptive mesh refinement 

In the multigrid scheme considered above all the grids on finer levels are 
constructed by halving the grid parameter. This means that a uniform re- 
finement has been performed. It’s easy to imagine that such a procedure is 
not very economic because certainly there will be areas that are discretized 
too fine. A procedure where the error is distributed continuously over the 
whole grid would be much better. This can be achieved by refining those 
areas that contribute a large amount to the error. Of course, there are some 
instruments required to detect those regions. This is done by so-called error 
indicators. 

Such an adaptive process is well suited to be combined with the multigrid 
method. In order to do this, a so-called nested iteration is modified such that 
each single grid is generated adaptively. Figure 4.1 may illustrate this process, 
which is referred to as an adaptive multigrid method. The process is started 
by computing the solution on the coarse grid, which is required to calculate 
the error indicator afterwards. Subsequently to this, the construction of the 
new grid controlled by the error indicator can take place as well as the the 
generation of the new system matrix. With a following prolongation step 
the coarse grid solution is transformed onto the adaptively refined grid, so 
to speak as a first approximation. This approximation is improved later by 
performing one or more V-cycles. In most cases only one single cycle is needed 
because the solution on the intermediate grids is just necessary to compute 
the error indicator and, hence, may not require the full accuracy. Now the 
process described above is extended to further grids until the mesh is fine 
enough. At this moment, a coarse grid correction can be applied repeatedly 
until the desired accuracy for the fine grid solution is reached. 

The most crucial point within this refining procedure is the calculation of the 
error indicator. 




Fig. 4.1. Adaptive multigrid scheme 
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4.1 Error indicator 



An error estimator can basically be written in the form 

ciV < l|e|| < C2T), (4.1) 

where c\ and C 2 are real constants, i.e., they are independent from the solution 
itself. Now, the task is to find a quantity satisfying these conditions. In [5], 
it is shown that the residual error provides the required properties. 

In the following it is assumed, that there exists a bounded mapping L : 
X Y with a bounded inverse. For an integral equation of the second 
kind {L := XI — K) this holds at least for regular values of A and compact 
K. Assuming pn to be the approximation for the solution p of the system 
^P — 9^ l^he residuum E F can be defined as: 

r„ := g-Lpn = L(p-p„) = Le„. (4.2) 

With the assumptions mentioned above, the norm of the residuum can be 
casted into the form of an error estimator: 

Cl Iknily < llcnllx < C2||rn||y. (4.3) 



For the inhomogeneous HELMHOLTZ-equation the residuum reads as follows: 






da 

J 47ra \ a 

r 

/ pi/ca 

/(«)■!« 



‘-1 5a dp{y) 



for x' £ r with a= \x' —y\ . 



As stated in [5] for the GALERKiN-approach, the H^-Norm of the residuum 
can be used as an estimate of accuracy, although the real error is underes- 
timated. On the other hand, this quantity is well suited to act as an error 
indicator in an adaptive process. This is done by taking into account only that 
portion of the ’’error” contributed by a single element under consideration. 
One obtains an ’’element related” error indicator rjk according to 




Obviously, two integrations have to be performed, one to get the residuum 
and the other to compute its norm. Due to the complicated kernel functions 
the first one has to be evaluated numerically. Therefore also the second inte- 
gration requires numerical computation which, in practice, results in a large 
numerical effort. 

In order to reduce the numerical effort, a slightly different procedure is 
proposed here. Its application provides the residuum to be sufficiently smooth 
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at least in the range of one element. Thus a projection can be applied which 
maps the residuum onto a finite dimensional space X* containing the solution 
space Xn as subspace. The additional degrees of freedom are posed on the 
element edges right between two vertex nodes. This leads to a special kind 
of interpolation which also takes into account that the residuum vanishes at 
the nodes related to Xn (the vertex nodes for linear elements). 

This interpolation allows now the integration of the norm to be carried out 
analytically. Moreover, equation (4.4) has to be evaluated only at the addi- 
tionally introduced nodes. Thus, the numerical effort is significantly reduced 
such that the method becomes applicable in practice. 

4.2 Refinement strategy 

In various cases it is desirable for the collocation method to have only con- 
forming elements, i.e., no “hanging” nodes shall occur. This can easily be 
achieved when the elements were subdivided by halving the edges where the 
neighboring element containing the same edge is subdivided too. On the other 
hand, this method contains a certain drawback: if elements are refined such 
that all their edges are subdivided the refinement can easily extend to those 
regions that do not need any refinement. Hence, whenever possible only those 
edges should be subdivided whose residual value contribute considerably to 
the value of the error indicator. 

Therefore, the element related error indicator (4.5) has to be converted 
to an edge related one. This can be done, for example, by the instruction 

VEdgeix'i) = {r)Elem\ ( 4 - 6 ) 

jee ^1=1 

where 0 represents a set of elements containing point xj and Nioc denotes 
the number of residual points in the element under consideration. 

In order to generate the new mesh, first, a refining rate is defined as the 
ratio between the number of subdivided edges and their total number. Then, 
by bisection a threshold value for the error indicator is determined such that 
the desired refining rate is reached. Edges whose error indicator exceed this 
value are marked for to be refined. Now, the new mesh can be generated 
by subdividing all the marked edges. In order to prevent the vertex angles 
of the elements from degeneration, all elements have to pass certain tests. 
Those elements not passing this tests get further edges marked and the mesh 
generation process is repeated until all elements have passed. 

The adaptive multigrid method proceeding is demonstrated by an exam- 
ple. Here a sound-hard box is considered which is exposed to sound waves pro- 
duced by a single point source right in front of the face {x = 0.2a, y = 0.05a, 
2 ; = 0.95a in Fig. 4.2). The geometry is shown in figure 4.2 as well as the 
initial discretization which is made of only 10 linear quadrilateral elements. 
Note that this special discretization is completely unsuitable for such kind 
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of configuration. Now, the adaptive process is performed over several refine- 
ment steps in order to adapt the mesh to the given situation. The resulting 
mesh is also plotted in figure 4.2. Here, a strong refinement of the areas in 
the neighborhood of the source can be recognized. Moreover, an intensive re- 
duction of the mesh size at the edges becomes visible, whereby the so called 
edge- and corner-singularities are taken into account. So, it seems that the 
physical situation has been approximated quite well. To rate the efficiency 
of the adaptively refined mesh, a comparison with a uniformly refined one is 
done. Figure 4.3 shows the reduction of the defect or residuum (related to 
the right hand side) versus an increasing number of nodes. 




Fig. 4.2. Original discretization and mesh after 8 refining steps 




Fig. 4.3. Convergence for adaptive and uniform refinement 
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This figure clearly demonstrates the much better convergence rate of the 
adaptive method. Instead of approximately 2000 nodes for the uniform re- 
finement the adaptive one needs only about 80 to 90 nodes to achieve the 
same accuracy. Similar results are obtained when the reduction of the defect 
is plotted versus the computing time (as it is done in Fig. 4.4). Although 
the evaluation of the error indicator is a very time consuming process, the 
convergence rate of the adaptive scheme related to the computing time is 
much better than in the uniform case. The same accuracy is reached with 
the uniform refinement in about 900 CPU-seconds while the adaptive method 
needs only 8 CPU-seconds. 




Fig. 4.4. Adaptive versus uniform refinement related to computing effort 



5 Acoustic problems in time domain 

It is well known that time-dependent acoustic wave propagation phenomena 
in homogeneous media may be described by Kirchhoff’s integral equation. 
This integral formulation is especially advantageous when exterior radiation 
problems shall be considered. But for three-dimensional problems, its numer- 
ical evaluation via boundary element procedures needs high computational 
effort, and, hence, all procedures should be as efficient and also as stable as 
possible. 

For this reason, in this paper, both the method of point-collocation and 
the Galerkin method, proposed by many mathematicians for having better 
convergence, are considered [6]. Another important point of the investigation 
is the instability arising when choosing the time-step too small. Here, follow- 
ing an advice of Morino et al. [7] a new formulation of one of the boundary 
integral terms is derived and studied regarding to its efficiency. 
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Moreover, the use of adaptive meshes is studied with special regard to the 
problem of moving noise sources. A simple, but nevertheless effective error 
estimator is presented, and algorithms for local mesh refinements but also for 
its coarsening are developed. This results in the effect of a “moving” mesh 
which changes with time: a fine mesh in the neighborhood of a moving noise 
source and a coarser grid in a larger distance of it. 

The performance of these algorithms is demonstrated. Finally, a rather 
complex example, the noise distribution due to a fast moving noise source 
in a housing area protected against noise by walls, is solved. Results may be 
found in [6]. 



6 Analysis of different boundary element 
methodologies 

The Kirchhoff integral equation is a mathematical model of small-am- 
plitude acoustic waves propagating with the speed c through homogeneous 
media: 

t 

^[q{x,T)p*{x,^-,t-T) - pix,T)q*{x,^;t-T)] 



0 r 



+ P d/2x|dr 

Q 



(6.1) 



It is essentially a reciprocal relation between the acoustic pressure p and its 
flux q across the boundary F of the domain Q and the corresponding states 
p* and q* of a unit point source in the full space [8]. The jump coefficient 
d{^) is AQ/j^'k with the inner solid angle zii7, and 7 (x,t) denotes the density 
of sound sources in the interior of the domain Q. The mentioned full space 
solution is (r =1 X — ^ |): 



P*{x,i\t-T) 



47rr 



( 6 . 2 ) 









r V c/ or \ c/ 



dr 



where i? is the Dirac function. Then, taking advantage from the relation 
d r\ d 



the equation (6.1) is usually transformed to 



(6.4) 
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dr 1 
dnx 47rr 



^p{x, tr) + -p{x, tr) I dCc 
ot r 



d(i)pi^,t) - j 

r 

f I . 



1 

47t I ^ 



(6.5) 



where tr = t ^ denotes the so-called “retarded” time with the property 
p{x, tr) = q{x, tr) = 0 for tr < 0. Here, as a special case, the sources are 
considered to be concentrated at an interior point and to have a time 
dependent strength gi{t). This expression is also correct when the position 
is moving with constant speed. 



6.1 Comparison of Point Collocation and Galerkin Method 



The discrete form of equation (6.5) is developed by the usual boundary ele- 
ment techniques where interpolation functions as simple as appropriate are 
implemented on the elements Fe of a triangular surface grid and in equal time 
intervals At, i.e., ^j{x) bilinear in the space variable x and 6{t) linear in time 
t. Then, one obtains from (6.5) without the source term by collocation at all 
time steps tk,k = l(l)m 



n m n ^ .. 

j=l 1=0 j=i 




^l{tk—lr) 

cAt 




( 6 . 6 ) 



In order to obtain as many equations as unknowns, two methodologies are the 
most common ones: the point collocation method and the Galerkin method. 
In the first method, also collocation in space, i.e., at all nodes Xi of the 
shape functions has to be performed. This results in the following system 
(vi =\xi - X I) where ^j{xi) = i9ij: 



n m n « j 

4-Kd{xi)Y^p^(Pj{xi) - / —Oi{tku)V’jix)dr^ 

j=l 1=0 j=i p * 

i=o p 



(6.7) 



In the Galerkin approach, the final system is obtained by determining the 
weighted residual of the approximated system (6.6) where the above intro- 
duced shape functions ^i{x) on the boundary elements are used as weighting 
functions, i.e., for i = 1, ..,n: 
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0 = 



n « 

i=i i 

^ ^ f r f 1 

j J -HtkrWjiy)^i{x)drydr^ 

r r 

X v’j(y)<^i(»)drj,dra.| 



( 6 . 8 ) 





Fig. 6.1. Integration area and its subdividing for the point collocation method 
and integration area for the Galerkin method 



The main difference between both the methodologies is the number of 
integrations needed. Besides, for the considered three-dimensional acoustic 
wave propagation problems, the integration area is dependent on the dis- 
tance which a wave, started at has already traversed: for each integration 
point X, that area S\xi) has to be found where x is between the sphere with 
radius (/ — l)cZ\t and that with radius IcAt. Then, for the time step jjiAt, the 
integration area is the intersection of the spherical shell S^{xi) with “thick- 
ness” At and of the shape function’s support supp((^i) = {x \ (f^ix) ^ 0}. 
Hence, for numerical integrations as exact as possible, e.g., by the Gaussian 
quadrature, enough Gaussian points must be placed in the interior of that 
“integration area” and, moreover, this area has to be approximated adequate 
to the applied quadrature formula. Here, an appropriate subdividing in sub- 
triangles is performed (for details, see [6]). This task is not easy in the point 
collocation method, but much more difficult in the Galerkin method. There, 
the integration area for the variable x is the support of (pi and that for y the 
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above defined area spt{(pj) fl S\x) where here x is not a fixed collocation 
point but one of the Gaussian points in the support of (pi (see Fig. 6.1). 

It is known from theoretical analysis that boundary values are approxi- 
mated by both the methods with a convergence like 0{li^), but for results 
at interior points the convergence behavior of the point collocation method 
should be 0{h‘^) while that of the Galerkin method is expected to be O(h^). 
Numerical studies confirm this perfectly, but from an economical point of 
view, the numerical efforts or costs are much more important for the decision 
which method is “better”. In this aspect, for errors not smaller than about 
one per cent, the point collocation method is much better (Fig. 6.2), e.g., for 
determining an interior value with an accuracy of 5 %, the Galerkin method 
needs twice the CPU-time of the point collocation method. 




error at internal points 

Fig. 6.2. Numerical effort versus error for interior point results: Comparison of 
point collocation method and Galerkin method 



6.2 New BE realizations of Morino’s modified KirchhofF’s 
equation 

It is known from numerical studies that for too small time steps At the 
point collocation procedure (6.7) of equation (6.5) becomes unstable. In a 
mathematical analysis, Figuieredo/Morino [7] found some theoretical rea- 
sons and proposed to use the following transformation 
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/ 



dr \ d ( 
drix cr 








1 9r a / ^ 



(6.9) 



where ~^p{y,t - means the normal derivative of p only in its second 
variable ^ and, moreover, to approximate it by 



- ?) “ i; (K- - - K-< - 



Here, An has to chosen appropriately dependent on the boundary element 
discretization. Unfortunately, this modification has the defect that for cases 
where the collocation point yj^ is in the same plane as the triangular in- 
tegration domain, and rk —\ x — is small, the approximation does not 
give the correct value zero, resulting from = 0. To overcome this defect, 

backward or central difference quotients for can be used [6] where higher 
order quotients are to prefer since they have less numerical damping, e.g., 
the second order backward difference quotient: 



dr 



P 





1 

2Z\r 





r -h Ar 
c 




r -f 2zir 
c 



) 



( 6 . 11 ) 



Moreover, with these modifications of Kirchhoff’s integral equation, also 
small time steps At are numerically stable, but one has to take care that the 
resulting small integration areas (see the above explanation) are integrated 
exact enough. For studies of the numerically efficiency see [6]. 



7 Adaptive meshes for moving noise sources 

The efficiency of a numerical procedure batsed on discretizations of either 
the domain or the boundary is very much dependent on the mesh refinement 
strategy. This is especially important if the sources of waves are moving, since 
then the mesh has to be fine in the neighborhood of the source and may be 
coarse in a larger distance. But, in any case, a simple but nevertheless effective 
error indicator is essential. 
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7.1 An Effective but Simple Local Error Indicator 

From the literature, several types of error indicators are known also for bound- 
ary element procedures, mostly based on the residuum. Here, the error in- 
dicator is based on the often realistic assumption that the sought solution 
is smooth and its curvature does not change very much from element to el- 
ement. Hence, in the case of linear shape functions over triangular element, 
one determines the change of the slope when crossing the boundary between 
two adjacent elements. If this angle is essentially smaller than tt, both the 
adjacent triangular element are subdivided. Obviously (see Fig. 7.1), the nu- 
merical effort is much less for this simple error indicator than that when using 
a residuum based indicator. 




error (L2-norm) 

Fig. 7.1. Error indicator “slope change” and comparison of numerical effort 



7.2 Strategy For Refining and Re- Coarsening Grids Creating a 
“Moving” Mesh 

The needed subdivisions have to be performed appropriately in order to allow 
not only refinements but also the re-coarsening of the grid when the noise 
source producing the acoustic waves has moved away far enough. In the 
refinement procedure, all “neighbor” elements are marked which have been 
subdivided simultaneously, and only these “related” elements are permitted 
for a re-coarsening (Fig. 7.2). Obviously (Fig. 7.3), the numerical effort for 
solving a problem with the same accuracy is much less when adaptive grids 
are used. 
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refining re-coarsening 




Fig. 7.2. Example for a refining and re-coarsening strategy for triangular grids 




number of nodes 

Fig. 7.3. Comparison of convergence: adaptively versus continuously refined grids 



8 Conclusion 

The paper demonstrates some methods to improve the efficiency of boundary 
element methods in frequency domain as well as in time domain. Particularly, 
for both methods, the adaptive mesh generation turned out to be a very 
efficient tool to speed up boundary element computation and reduce the 
amount of storage, too. For large systems which, in practice, can hardly be 
avoided, the numerical effort can be reduced considerably by combining the 
adaptive process with a multigrid solver. Moreover, by means of the ILU 
near-filed smoother proposed here the convergence behavior of the multigrid 
method is improved especially for higher frequencies. 

On the other hand, due to the fully populated system matrix the required 
storage may be still too large for some realistic problems. Particularly in time 
domain, the distributed computation on a parallel computer [9] is well suited 
to overcome those difficulties to some extend. For the frequency domain, 
methods like panel- clustering [10] or the use of wavelet approximations [11] 
are very promising. 
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A Boundary Element Formulation for 
Generalized Viscoelastic Solids in Time 
Domain 

M. Schanz, L. Gaul, W. Wenzel, and B. Zastrau 



1 Introduction 

The boundary element method (BEM) provides a powerful tool for the calcu- 
lation of elastodynamic response in frequency and time domain. Field equa- 
tions of motion and boundary conditions are cast into boundary integral 
equations (BIE), which are discretized only on the boundary. The boundary 
data often are of primary interest because they govern the transfer dynamics 
of members and the energy radiation into a surrounding medium. Formula- 
tions of BEM currently include conventional viscoelastic constitutive equa- 
tions in the frequency domain. Herein viscoelastic behaviour of materials in 
time domain is implemented in a boundary element formulation. The consti- 
tutive equations are generalized by taking time derivatives of fractional or- 
der. Instead of generating a viscoelastic fundamental solution in time domain 
used by the first two authors before, the present approach uses an analyti- 
cal integration of the boundary integral equation in a time step. Viscoelastic 
constitutive properties are introduced after taking Laplace transform of an 
elastic- viscoelastic correspondence principle. The transient response is ob- 
tained through the inverse transformation in each time step and the elastic 
as well as a viscoelastic wave propagation in 3-d continuum are studied nu- 
merically. 

The transient behaviour of 3-d continua in time domain described by 
boundary element formulations currently is limited to elastic solids. Be- 
haviours of viscoelastic solids in frequency domain are treated effectively by 
BEM. Calculation of transient response via the frequency domain requires the 
inversion of Fourier transform. Since the frequency response is known only in 
a limited frequency range, truncation effects exist. On the other hand, direct 
calculations of the response of viscoelastic solids in time domain require the 
knowledge of fundamental solutions of viscoelastic materials. Such a solution 
can be obtained by means of an elastic- viscoelastic correspondence principle. 
In [8], the elastic fundamental solution was transformed into Laplace domain 
and then the elastic-viscoelastic correspondence principle was adopted. For 
a simple rheological material, an analytical inverse Laplace transformation is 
given which leads to the viscoelastic fundamental solution in time domain. 
For the implementation of this viscoelastic fundamental solution in a 3-d time 
domain BEM program, it is advantageous to perform the time integration in 
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a time step analytically. This was carried out in [15], but leads to a very com- 
plicated series solution. An alternative approach for obtaining a viscoelastic 
boundary integral formulation in time domain is given in [7] . The generaliza- 
tion of this approach for establishing constitutive equations with better curve 
fitting properties of measured data is the objective of the present study. 

It is known from linear acoustics [11] that time and space discretizations 
of BIE lead to instability, if the time step is smaller than a critical value. 
The present paper also displays this instability associated with numerical 
damping in elastodynamic response. 

An extension of the stability range and reduction of numerical damping 
are achieved by modified representation of terms in the fundamental solution 
for traction according to a suggestion by Figueiredo et al in acoustics [4]. 



2 Constitutive equation 



Decomposition of the stress tensor aij into the the hydrostatic part Sijakkl^ 
and the deviatoric part Sij yields 



(Tij — ^CFkk^ij ^ij 



where su = 0. 



The corresponding decomposition holds for the strain tensor £i 



_ 1 

£ij — ^^kk^ij ^ij 



where eu — 0 



( 2 . 1 ) 



(2.2) 



which results in the following two independent sets of constitutive equations 
for viscoelastic materials: 



^ ^ , d^ ^ „ d^ ^ n d^ 

(2-3) 

k=0 k=0 k—0 k=0 



N 



M 



More flexibility in fitting measured data in a large frequency range is obtained 
by replacing the time derivatives of integer order by fractional order [6]. The 
derivative of fractional order a is defined by 



d^x (t) 
dt^ 



1 d 
r{l-a)dt 



J 



0 < a < 1 



(2.4) 



OO 

in which r(l — a) = / e~^x~^dx is the Gamma function. The inverse 
0 

operation of fractional integration is attributed to Riemann and Liouville 
[13]. A different definition based on generalized finite differences is given by 
Griinwald [10] as 
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lim 

N^oo |v ^ r (-a) r (j + 1) 




(2.5) 



This discrete definition is more convenient in representing constitutive equa- 
tions by time stepping algorithms, and can be shown to be equivalent to the 
definition given in equation (2.4). The fractional derivatives in equations (2.4) 
and (2.5) appear to be complicated in time domain. However, their Laplace 
transform reveals the useful result 



( 2 . 6 ) 

where s is the Laplace variable, and vanishing initial conditions are assumed. 

With the definitions (2.4) and (2.5) the generalized viscoelastic constitu- 
tive equations are given by 



N 



Y^p'k 









k=0 









N 










fc =0 






(2.7) 



For N — M ■= \ and integer time derivatives, a corresponding three para- 
meter rheological model with two springs and a viscous dashpot is shown in 
Fig. 2.1. 



E2 




This is one of two possible representations of a uniaxial stress strain equa- 
tion 



d d \ 

p-, + „ = E[e + ,~ej 

with 

F‘2 El E‘2 El 

V = E = a = — . 

E\ -h E2 El -j- E2 E2 



(2.8) 



(2.9) 



The use of fractional derivatives into the model leads to the second possible 
representation of the generalized model with five parameters 




34 



M. Schanz, L. Gaul, W. Wenzel, B. Zastrau 



According to [2], a credible model of the viscoelcistic phenomenon should 
be able to predict non-negative internal work, a non-negative rate of energy 
dissipation, and the viscoelastic wave speeds 



Clv — 



/a:(o) + |g(o) 




( 2 . 11 ) 



The initial modulus of the relaxation functions K (0) and G (0) have to be 
finite. These are fulfilled, if the orders of fractional derivatives are the same 
for all terms [14], i.e. 

a = /3, (2.12) 

which is in accordance with experimental data. Setting a equal to /? and 
satisfying the restrictions mentioned above, we arrive at the constraints [14] 



E >{)q> p 
g > 00 < a < 2 

p > 0 



and the viscoelastic wave speeds become 



Clv — 



Pk ^ Pg 




(2.13) 



(2.14) 



If the same damping mechanisms are assumed in hydrostatic and deviatoric 
stress strain state, the corresponding 3-d constitutive equations are obtained 
by replacing the uniaxial stress and strain by the hydrostatic and the de- 
viatoric states. A powerful tool for calculating viscoelastic behaviour from 
a known elastic response is the elastic- viscoelastic correspondence principle. 
According to this principle [5], the viscoelastic solution is obtained from the 
analytical elastic solution when the elastic moduli in the Fourier or Laplace 
transformed domain is replaced by the transformed impact response functions 
of the viscoelastic material model. The viscoelastic solution is then obtained 
by the inverse transformation. 

The elastic- viscoelastic correspondence for the above mentioned general- 
ized three parameter model is given by 






1 -f QkS^^ 
1 -\-PkS^^ 



2G 2G 



1 -f 
1 -f 



(2.15) 



in which K is the elastic bulk modulus, G the elastic shear modulus, and the 
transformation property (2.6) is used. 




A BE-Formulation for Generalized Viscoelastic Solids in Time Domain 



35 



3 Elastic BE-formulation 



The boundary integral equation of elastodynamics in time domain is recalled 
for completeness. The Lame field equations of a homogeneous isotropic elastic 
domain Q are given by 



(c'l - 4 ) 






"h ^2 — Uj 



(3.1) 



with displacement Uj(x,t) and wave speeds 



2 K + ^G 

4 = — 






(3.2) 



The corresponding traction and displacement boundary conditions over the 
boundary F are 



ti (x, t) = (TikTlk = Pi (x, t) 
Ui (x,t) = Qi (x,t) 

and the initial conditions are 



X € Ft, 

xe Fu with F = Ft U Fu (3.3) 



Ui (x,0) =uoi (x), 

iii (x, 0) = voi (x) xe n. (3.4) 

The 3-dimensional Stokes fundamental displacement tensor of the Lame equa- 
tion (3.1) under the body force 6j(x,^,t,r) = S {t - r) S {x - ^) ej in an 
unbounded space is given by (e.g. [6]) 



Uij{x,^,t,r) = 




where r = y/riri , u = Xi — ^i is the Euclidian distance between the field 
point X and the load point The capital H denotes the unit step function, 
and S the Dirac distribution. The corresponding fundamental stress vector 
components are obtained by substituting (3.5) into the constitutive equation 
and adopting Cauchy’s stress formula with the outward normal Uk 



go{r) iL. 



.2 r 
2 



47T 



+ 91 (r) 



H[t-T--] -nit-T-- 
Cl / V C2 



r \ 



8 t — T -j 8'[t — r 



Cl J Cl 



Cl 
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+ 92 (r) 



5 it — T — 



C‘2 



C2 






02 



+ 93 (r) 
+ 54 (r) 




(3.6) 



where ~ ~ denotes the derivative of the Dirac distribution with 

respect to the argument (t - r - The functions depending on spatial 



coordinates are abbreviated by 

g 

90 (0 = ^ {niTj + Ujr^i + dijr^kUk - 5r^irjr,knk) (3.7) 

9.M = -(i-2(^)')^ (3.8) 

9'^ W ^ + ^a'^.knk) (3.9) 

2 

93 (r) - {rurj + nj-r.j + SijUkr^k ~ Qr^rjr^knk) (3.10) 

2 

94 (r) = r^irjr^kUk. (3.11) 



The dynamic extension of Betti’s reciprocal work theorem [9] combines 
two states of displacements and tractions, {uij,iij) and (ui^ti) respectively, 
and leads to the integral equation 

CfjUj — ^ ^ij ^ dr H" J' Q ^ bj “I" UijVQj UjjUoj j df2 

r Q 

(3.12) 

where Cij = 8ij /2 for a smooth boundary. The * denotes the convolution with 
respect to time and is defined by 






0 

t 

f — t)W (r) dr 
0 



t<0 
t > 0. 



(3.13) 



The integral equation (3.12) reduces to a boundary integral equation if the 
body forces bj and the initial conditions vanish. Discretization of the bound- 
ary integral equation in the spatial and time coordinate leads to the bound- 
ary element formulation. Only the time discretization by n equal At steps 
is discussed here. The simplest nontrivial choice ensuring that no terms are 
dropped out in the boundary integral equation is using linear shape functions 
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for the displacements Ui and constant shape functions for the tractions U in 
time domain 



u, (x,r) = r,t (x) (3.14) 

(x, r) = 1 • T/j”* • (x) . (3.15) 



The spatial shape functions for the E elements are denoted by (x) and 
(x). The nodal values are for the corresponding boundary el- 

ement Fe at time tm = mAt. Substitution of equations (3.14) and (3.15) 
reduces the boundary integral (3.12) to 



t 

j j [U (x, r) • Uij (x, - iij (x, - Ui (x, r)] dF dr 

0 r 

= tttf f[ »«(x,^,t-r) -I- nt i^)-TP 

€=1 1=1 m=ld f 

i e — 1 






ki {k,u- r) ■ nt (x) • ( + t/r“ - 



tm T 



At 



dr dr. (3.16) 



Time integration of equation (3.16) can be carried out analytically because of 
the time dependency of the Dirac and Heaviside functions. This integration 
leads to piecewise defined functions [16]. For the sake of brevity, the integral 
of the first term on the right hand side of equation (3.16) only is given here, 



tm 

/ 



tm — 1 



Uij (x,^,f - t) dr = 



ro 



1 

4itq 



fo (r) I [(* - tm-iY - j + /i (r) ^ 
fo (r) [ttm - ttm-l - ^ + 



fair) 

0 



^[(ty 



(< - tm) 



+ /2(r)^ 



t < tm-l + ^ 

tm-X + + t 

t < ^ < ^ 

tm-X + t < ^ + t 

tm + f^<t 

(3.17) 



with the following abbreviations depending only on spatial coordinates: 



fx (r) = ^ 
r 
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The time intervals are valid only for the time step At < ^ After 

substituting tm = mAt^ tm-i = (m — 1) At and t = nAt in equation (3.17), 
it can be seen that the functions depend on (n — m). This implies that these 
functions depend only on the difference between the observation and the 
excitation time. 

After time and space discretizations, a system of algebraic equations [1] 
is obtained 



1 ( 1 ) 

jl+T 



(n) 2-4 



u 



m=l 



E u t , 

m=l 



(3.18) 



(m) (m) 

where I is the identity matrix, T and U are the influence matrices of 

(m) (m) 

stresses and displacements at the time step m. The vectors u and t contain 
all nodal displacements and tractions of the time step m. 



4 Viscoelastic BE-formulation 



In order to obtain a viscoelastic boundary integral formulation from the elas- 
tic formulation (3.18), the elastic-viscoelastic correspondence principle (2.15) 
is applied. This requires the Laplace transformation of (3.18) first. The kernels 
of the matrices consist of the function (3.17) and the fundamental solutions 
of the tractions after time integration. For the sake of brevity the procedure 
of deducing a viscoelastic formulation is presented for the integral (3.17) only. 
More details can be found in [14]. 

The one sided Laplace transformation 

oo 

£(/) = F(s) = I f(t)e-^^dt 

0 

x-\-ioo 

/W = ^ / (4-1) 

x—ioo 



leads to the transformation of equation (3.17) 

oo ^ tm 

j ^ J Uij (x, ^,t — T) dTe~^*dt 



0 






= EJ / fo(r)\ ((* - - (^) j + (-) 
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+ 



/ /oW 







t ^m—l 




‘-m-1 

2 






+ 







/o(r) 




2 2 2 




+ /2 (r) 






• (4.2) 



After integrating the time interval of (4.2), the elastic constants are replaced 
by the viscoelastic impact response functions. The elastic bulk and shear 
moduli appear only in the compression wave speed c\ and the shear wave 
speed C 2 . Thus the corresponding viscoelastic expressions 



(c;.)' = - 



1 + 



1 +PkS°‘«: 
1 + 






^ 4^ 1 + gGS«g \ 

3 l+PoS“Gy 



(4.3) 



are inserted into equation (4.2). After taking Laplace transformation of equa- 
tion (4.2), this leads to 



^ / w«(x,^,i-r)dT 

m=l \ 

1 H_c 1 

+/i W + h W r- 

5 5 





e 






(4.4) 



Causality of the solution implies that no response is present prior to the 
arrival of the viscoelastic waves (2.14). This physical requirement is assumed 
for the numerical inversion of equation (4.4). The correspondence relation 

F(s)e-**”*-i - F(s)e-*‘"‘ / (f - t^-i) H{t- tm-i) - f {t - t^) H{t- f„) 

(4.5) 

is taken into account as well. The inverse transformation therefore leads to 
a piecewise defined function in the time ranges of equation (3.17), where ci 
and C ‘2 are replaced by c\y and c* 2 t;- The inversion is carried out numerically 
by the method of Talbot [17]. 



5 Numerical example 

The propagation of waves in a 3-d continuum has been studied by the present 
boundary element formulation in time domain for two examples. They involve 
a 3-d pressure bar and an elastic foundation on a viscoelastic soil. 
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5.1 3-d pressure bar 

The problem geometry and the material data (corning glass at 550^(7) are 
shown in Fig. 5.1. 




E 

Q 

P 

Q 

V 

a 

p(*) 



2.075 ■ 10® jgr 

1000 ^ 

3.5 



3.5/5/10/26 s' 

0/0.25 

0.635/1 



Fig. 5.1. Step function excitation of a free-fixed bar 



The free end of the bar is excited by a unit step loading function H {t) 
from a pressure pump. The opposite end is fixed at the nodes. The use of 
different spatial shape functions and different meshes are studied first. The 
three discretizations are shown in Fig. 5.2. The time step size At was chosen 
close to the time the viscoelastic compression wave c\v takes to travel across 
the largest element. In Fig. 5.3 the displacements and tractions at the middle 
point of the free end (observation point) in longitudinal direction are plotted 
versus time. 

It can be clearly seen in Fig. 5.3 that a finer mesh or spatial shape func- 
tions of higher order improve the result. Also, it is shown that the meshes for 
the tractions are too coarse. 

As a second test, the results of the current BEM formulation are compared 
with a Finite Element (FE) formulation. In this test, the parameter a of the 
fractional derivative must be set to 1, because the authors have no available 
FE program for calculating viscoelastic response using fractional derivatives 
in time domain. 

The FE discretization and the comparison of results are shown in Fig. 
5.4. The displacements at the observation point in longitudinal direction are 
compared. Mesh B is used for the BE formulation. The comparison shows a 
good agreement of both formulations. 

In Fig. 5.5, the behavior of the travelling waves in the bar is studied for 
several values of the damping parameter q in the constitutive equation (2.8). 

The increase of damping parameter q stiffens the solid and increases the 
wave speed, resulting in shorter travel times and smaller deflections as shown 
in Fig. 5.5. 
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mesh A 

linear shape function 
with 40 boundary 
elements, 42 nodes 
time step At = 0.00052 

observation 

point 



s 




mesh B 

quadratic shape function 
with 40 boundary 
elements, 122 nodes 
time step At = 0.00052 s 




mesh C 

linear shape function 
with 90 boundary 
elements, 92 nodes 
time step At — 0.00050 s 



Fig. 5.2. Different meshes of a free fixed bar 




displacement u/m 
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5.2 Foundation slab on soil 

As a second example an elastic concrete foundation slab on a viscoelastic sand 
soil is considered. In the bonded contact area of the two domains continuity 
in stresses and displacements is assumed. 

The problem geometry and the material parameters are shown in Fig. 5.6. 




Fig. 5.6. Elastic foundation slab on viscoelastic soil 



For the discretization (Fig. 5.7) of the surface of interest, linear shape 
functions are adopted. The time step is chosen such that the viscoelastic 
compression wave with velocity ci^ is nearly propagated over one element 
during a time step. The element diagonal is selected herein as characteristic 
length. 




Fig. 5.7. Discretization of foundation slab and soil surface 
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The displacements us perpendicular to the halfspace surface are plotted 
versus time for different values of the viscoelastic parameter q in Fig. 5.8. 




time t/s time t/s 

Fig. 5.8. Displacement us at the points A and B 



The figure on the left hand side presents the displacements at point A 
and the figure on the right hand side the displacement at point B. Similar 
to the response of the pressure bar discussed in section 5.1, the material is 
stiffened with increasing value of q. This is observed by smaller deflections 
and a shorter period of vibration. The propagation of waves on the surface 
is depicted in Fig. 5.9, where the magnitude of the displacement field at the 
nodes is shown. 



6 Stability and numerical damping of the elastic 
solution 

The time stepping algorithm (3.18) in conjunction with the viscoelastic for- 
mulation leads to instability of the solution, if the time step At is chosen too 
small. This problem can be observed by calculating the wave propagation in 
an elastic free-fixed rod made of steel as shown in (Fig. 5.1). The load func- 
tion, geometry and spatial discretization by 40 linear elements are shown in 
Figs. 5.1 and 5.2. The longitudinal displacement of the centroid of the free 
end is plotted versus the time step number n in Fig. 6.1. 

If the the time step size At is chosen below a critical value, an instable 
displacement response occurs as shown in Fig. 6.1. This behaviour differs 
from the time marching schemes applied to FE equations. 
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t = 0.0034 s 




t = 0.01 19 s 




t = 0.0170 s 



displacement u/m 

5.71 E -09 

4.90 E -09 



4.08 E -09 m 

m 

M 

3.27 E -09 



2.45 E -09 



1.63 E -09 



8.16 E -10 m 

Am 



m 

9.93 E -14 “■■■■ 



Fig. 5.9. Displacement field of propagating waves on the surfaces of foundation 

slab and halfepace 
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★ 10-5 * 10-5 




time step n time step n 

Fig. 6.1. Stable (At = 0.002699 s) and instable (At = 0.002698 s) rod 

displacements 



The decay of the displacement amplitude as shown in Fig. 6.2 is due to 
numerical damping, because the material is elastic. For comparison purposes, 
the analytical 1-d elastic solution 






Hit- — 



c = 




associated to the 3-d problem is plotted in Fig. 6.2. The numerical damping 
can be distinguished for different values of At. The reason for the numerical 
problems of instability and numerical damping is due to the term in equation 
(3.6) which contains the derivative of the Dirac distribution. It has been 
shown for the Kirchhoff Integral Equation, representing the wave equation of 
linear acoustics [4] , that instabilities can be partly removed by approximating 
the derivative of the Dirac distribution by finite differences. In the present 
paper, this idea is adopted and modified for the application in elastodynamics 
by using finite differences of higher order. Equation (3.16) contains integrals 
of the type 



_ 1 ^ 

At 






rdr 



iS-1,2. 



(6.2) 



These integrals unfortunately cause the above mentioned problems in equa- 
tion (3.16). A model problem with the structure of equation (3.16) treated 
by [4] indicates the possible loss of stability. The problems can be partly cir- 
cumvented by replacing the derivative with respect to the argument by the 
spatial derivative 
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time step n 




Fig. 6.2. Rod displacement of 3-d problem with numerical damping compared 
with 1-d analytical solution 




While a finite difference of first order is used in equation (6.5), further im- 
provement can be obtained by using higher order finite differences. Results 
of the following finite difference representation are shown in Fig. 6.3 
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first order backward 









central 



second order backward 



— C/?— 5 \ t — T 






3S(t-r - — - 4S(t - T - -\-S (t-T - 



time step n 



first order backward 

second order backward 

— central 



Fig. 6.3. Rod displacement with reduced numerical damping 



The central difference leads to poor results, however, the backward difference 
scheme improves the computational behaviour. The new parameter Ar of 
the finite difference has to be properly linked to the space discretization, and 
does not depend on the time step size. Too large values of Ar lead to very 
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stable but damped results, and too small values tend to give instable but 
undamped results before instability occurs. Therefore, stable and undamped 
results can be obtained by properly choosing At and Ar. 



7 Conclusions 

Formulations in elastodynamics by BEM have generally been used to treat 
viscoelastic solids in Laplace or Fourier domain in which elastic moduli are re- 
placed by the corresponding viscoelastic moduli. The calculation of transient 
response in time domain involves inverse transformations [12]. The accuracy 
of the solution depends strongly on the limited number of available frequen- 
cies. To avoid this problem, the present paper uses the BEM formulation in 
time domain and converts the elastic formulation to that for viscoelastic ma- 
terials. This method turns out to be advantageous to convert the formulation 
after integration over a time step At rather than converting the fundamen- 
tal solution itself. The conversion is done by applying the elastic- viscoelastic 
correspondence principle in Laplace domain. The inverse transformation in 
each time step is performed numerically and thus allows one to account for 
general constitutive equations involving fractional time derivatives. The wave 
propagation in two viscoelastic continua are presented by BEM in time do- 
main as illustrative examples. The material behaviour is described by a three 
parameter rheological model, generalized by two fractional time derivatives. 

Different from what is known in time marching schemes used in FE equa- 
tions, too small time steps may lead to instability in the discretized BIE 
of elastodynamics of viscoelastic materials. The effect is shown to be more 
pronounced for elastic constitutive equations. 

After having replaced the derivatives of Dirac distributions in the traction 
fundamental solution by finite differences, the stability range was extended to 
smaller time steps. A generalized application of using higher order backward 
finite differences improved the stability range and significantly reduced the 
effect of numerical damping as well. 
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On the Efficient Realization of Sparse Matrix 
Techniques for Integral Equations with Focus 
on Panel Clustering, Cubature and Software 
Design Aspects 

W. Hackbusch, C. Lage, and S.A. Sauter 



1 Introduction 

The boundary element method (BEM) is an elegant tool for solving elliptic 
boundary value problems numerically. First, the method of integral equations 
is applied transforming the PDE on a domain Q into an integral equation 
defined on the boundary of i7. This integral equation can be discretized by 
Petrov-Galerkin methods defined on the surface of Q. Instead of the dis- 
cretization of the whole domain, only the lower dimensional boundary of i7 
has to be partitioned into a FE-grid. This is one of the major advantages of 
the BEM. Especially for 3-d problems, grid generation of the whole domain 
Q in many cases is still an extremely time consuming step. As a consequence, 
the dimension of the stiffness matrix is much smaller as for the correspond- 
ing FE-discretization. Furthermore, the matrix condition number is smaller 
compared to the FEM-system governing the convergence speed of iterative 
solvers applied to the linear system. On the other hand, the major drawback 
of the BEM is that the arising system matrix is full and, in addition, the com- 
putation of the matrix entries requires the evaluation of complicated surface 
integrals. Recently, many attempts have been made to overcome these two 
difficulties. The full matrix can avoided by representing the integral operator 
on the discrete level in an alternative form, which is based on the approxi- 
mation of the kernel function of the integral operator. The panel clustering, 
originally developed by Hackbusch and Nowak (see [9], [7]) for the colloca- 
tion discretization of the Laplace equation, turned out to be also applicable 
to any Petrov-Galerkin method, where we emphasize the important case of 
the Galerkin method where test and trial space coincide. A related method 
is the multipole expansion of Rokhlin (see [15]) which, however, can be ap- 
plied only to Nystrom discretizations and not, e.g., to the important case of 
the hyper singular formulation of integral equations. A further approach in 
this direction was presented by Brandt and Lubrecht in [4], [5] where multi- 
grid techniques together with high order inter-grid interpolation is used to 
condensate the system matrix. 

Another method to condensate the system matrix is the application of 
wavelets to boundary integral equations. It turns out that, by using a suitable 
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wavelet basis, the off-diagonal entries of the system matrix have a strongly 
decreasing behaviour with increasing distance from the diagonal (see [6], [20]). 

All condensation strategies have in common that the nearfield matrix en- 
tries, i.e., those elements lying close to the diagonal have to be computed in 
the standard matrix-oriented way. As mentioned above, this is a non-trivial 
and time-consuming task in BEM computations. The product of two basis 
functions with a possibly very complicated kernel function having a singular 
or nearly singular behaviour has to be integrated over surface pieces. Due to 
the singular behaviour of the kernel function, standard cubature techniques 
lose their accuracy. We will show that, for kernel functions arising by dis- 
cretizing boundary value problems via the BEM, the application of suitable 
coordinate transforms enables us to integrate the variable containing the sin- 
gularity analytically resulting in an integrand which has no or significantly 
reduced singular behaviour. For hyper singular integrals, we write the arising 
regularization in a form which can be evaluated in many cases analytically 
resulting in surprisingly easy formulae. 

In this paper, we discuss thoroughly the software design aspects which 
came up by the realization of the developed numerical algorithms. Especially 
we explain how the complexity of implementing the BEM can be managed 
by object-oriented design methods. Also emphasize is taken on parallelization 
strategies. 

The paper is organized as follows. In the next chapter, we go briefly 
through the setting of Petrov-Galerkin discretization of boundary integral 
equations and introduce the notations necessary for the sequel. Then, in 
Chapter 3, we investigate the computation of the surface integrals arising 
when computing the matrix entries of the linear system. First, we have to pro- 
vide some analytical properties of the arising kernel functions which then will 
be used in the design of eflicient cubature strategies. (Note that in more than 
one dimension the terminus quadrature is replaced by cubature). In Chapter 
4, we explain the panel clustering algorithm. After having developed the basic 
principle, the algorithm is formulated and an error analysis is presented. Esti- 
mates of the asymptotic complexity of the algorithm follows and remarks on 
their relevance for practical problem sizes are given. Chapter 6 is devoted to 
the design of modern BEM software in C4-4-. It will be explained which data 
structures are well suited in order to make the different algorithms eflicient 
and flexible with respect to different kinds of integral equations, discretiza- 
tion schemes as, e.g., collocation or Galerkin BEM, geometries and cubature 
techniques. 



2 Setting and Preliminaries 

Let us consider an elliptic boundary value problem on a domain i? C of 
order 2m of the form 
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Du = 0, 


in n 


(2.1) 


Bu = r. 


on F := dn 





with a system of m boundary differential operators Bj having mutually dif- 
ferent order. If i? is unbounded, suitable radiation conditions have to be 
imposed. We assume that the operator D has constant coefficients and the 
fundamental solution defined by 

DS = So, inM^ (2.2) 

S satisfies the radiation conditions, 

is known explicitly. Here and in the following, Sx denote the Dirac-functional 
located at the point x. In order to exhibit the principal ideas, it is sufficient 
to make the following assumptions, while we state that all presented methods 
can be applied to the general situation without any significant modification. 

- the space dimension d equals 3, 

- the order 2m of the operator is 2, 

- B = {d/dnY with j is either 0 or 1, whereas d/dn denotes the normal 
derivative, 

- the surface F is Lipschitz continuous. 

Using the (direct) method of integral equations, problem (2.1) is trans- 
formed into an integral equation which, in the variational formulation, is 
given by seeking u £ X such that 

(Au, v) + {Kiu, v) = (Ksr, v) , (2.3) 

is satisfied. Here, X, Y are suitable function spaces and (•, •} is a dual pairing. 
The operators K\ and K 2 denote integral operators which will be specified 
later. The function A is piecewise constant on F. The reason for presenting the 
formulation in this abstract setting is that most of the theory and algorithms 
discussed in the sequel can be presented in a more compact form as if dis- 
cussed for each choice of the spaces and pairing above separately. The Petrov- 
Galerkin method is characterized by defining finite dimensional subspaces Xn 
and Yn of X and F, i.e, finding Un G X^, such that (2.3) is satisfied for all 
Vn ^Yn. The definition of the discrete spaces is typically based on a geomet- 
rical partitioning of the surface F. For this, let T/v := {Aj : 1 < i < N} be a 
partitioning of F into small surface pieces. Here, we assume that Tn is a trian- 
gulation in the sense that there exists a family of diffeomorphisms {Xi}i<i<iv 

mapping the master element, i.e., the triangle with vertices (0,0)^ , (1,0)^, 
and (1, 1)^, onto the elements We assume that satisfies the 

following conditions: 

r = UzA€Tn 

2. Ai n Aj is either empty, a vertex, a (curved) edge or a (curved) triangle. 
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3. h := max diam(Z\) < C diam(Z\), 'iA G 

A^Tn 

4. max diam(B) > Ch^ € T^. 

B is a ball contained in A 

The finite element space Xn is given by lifting polynomials of degree p 
defined on the master element onto the surface, i.e., 

Xn := {u € (r) : u o Xi U ^ polynomial of degree p for all l<i< N] . 

Let 0n = {xi : I < i < n} C F denote the set of uni-solvent nodal points 
having the property that the interpolation problem of finding u e Xn such 
that u{xi) = fi for all 1 < f < n has a unique solution for any given vector 
/ G C^. The example below illustrates typical choices of discrete spaces 
Xn^Yfi ^d dual pairings. 

Example 2.1. 1. The Collocation Method is characterized by the following 

choices. Xn is defined as explained above and Yn := span {Sx • x £ 0n}- 
The dual pairing is given by {w,5x) = w{x). Explicitly, (2.3) takes the 
form: find u G Xn such that 

A (x) u (x) + Ki [u] (x) = K ‘2 [r] (x) , Vx G 0n- (2.4) 

2. The Galerkin Method is characterized by choosing Xn as above, Yn = Xn 
and (w,u) = (tt;,u)o, where (-,-)o denote the L^-scalar product on the 
surface F. This results in seeking u £ Xn such that 

(Aw,t;)o + {Kiu,v)q = (K2r,v)o , Vv G Xn- (2.5) 

In the following we collect some properties of the integral operators Ki ^2 
which will be used in the next chapters. All integral operators appearing in 
the context of solving elliptic boundary value problem via the method of 
integral equations can be written in the form 

K[u]{x) =p.i. j k{x,y,y - x)u{y)dy. (2.6) 

Some explanations are necessary. The kernel function k {x,y,y - x) is a suit- 
able Gateau derivative (of order less or equal than 2m) of the fundamental 
solution defined by (2.2). The fundamental solution behaves singularly only 
for X = 0 and is smooth elsewhere. In the three-dimensional case, we know 
(cf. [12]) that it can be estimated as 

1-5 (x -y)\<C l|x - yf"'~^ , Vx,y e r with x # j/. 

Hence, it is natural to assume that 

|fc(x,y,j/-x)| < Cllx-j/ir’, Vx, y e r with X 7^ y, (2.7) 

where the order of the singularity s is an integer smaller or equal than 3. 
With respect to the first both variables, k is smooth in smooth parts of the 
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surface F and may jump across edges and corners. An important case is the 
case of flat patches of the surface. In such regions, it is natural to assume 
that k is constant in the first two variables, i.e., 

k{x,y,y - x) = k{x -y) . (2.8) 

For 5 = 2,3 the kernel function is not Lebesgue-integrable. We have to apply 
the concept of regularized integrals in the sense of Hadamard. Let {x) 
denote a ball with radius e about x and 7 be a surface piece of F. Then, the 
functional J is well-defined by 

Xy [u] (e, x) := / k{x,y,y - x)u{y)dy. 

It was shown in [22] that, for all kernels which arise by transforming elliptic 
boundary value problems into integral equations, the function J admits an 
expansion w.r.t e of the form 

Jy [u] (e, x) = A_i (x) e“^ -h Aiog (x) log e -(- Aq (x) -f o (1) . 

The finite part of such an expansion is given by 

p.f. Xy [u] (e,x) := Ao (x) . 

€ 

In order to exhibit the dependency of Aq on 7 and u we alternatively write 
A^ [u]. We define the regularized integral (2.6) by 

p.f. / k{x,y,y - x)u (y) dy := p.f. [«] (e, x) = Aq (x) . (2.9) 

J-y ‘ 

Now, all general assumptions are collected which will be needed for dis- 
cussing cubature methods for the arising integrals and approximation of the 
kernel functions by means of the panel clustering method. 



3 Cubature Techniques for the Approximation of 
Singular and Nearly Singular Surface Integrals 
Arising in BEM 

In this chapter, we present cubature techniques for collocation and Galerkin 
discretizations of boundary integral equations. We start with the abstract 
Petrov-Galerkin formulation. Let and denote a basis 

of Xn and Yn respectively. Any function u € is uniquely linked to a 
coefficient vector tx E by the basis representation 

n 

»=i 



(3.1) 
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The Petrov-Galerkin discretization is then equivalent to solving the linear 
system, 

{M-hK)u = f 

with n X n matrices M and K and the n- vector / defined by 

fi ■= {K2r,(fii). 

The computation of the mass matrix M is trivial, since no integral operator 
is involved and M usually is sparse. The computation of / is similar as the 
computation of K, thus, we restrict our attention on the approximation of 
the entries of K. In the following we write K instead of K\. In the rest of 
this paper we assume that 

1. the integral operator K arises by transforming an elliptic boundary value 
problem into an integral equation, 

2. the trial space Xn is a subspace of the (continuous) function space AT, 

3. Xn C (r) for hypersingular equations, i.e., s = 3. 

An important property of the application of K to basis functions E Xn 
is stated in the following 

Theorem 3.1. Let j he a simple connected surface piece which consists of 
some triangles ofT^. Recall the definition of the part-fini integral (see (2.9)) 
and the order s of the singularity of the kernel function (see (2.7)). Let 

[0i] (x) := p.f. / k{x,y,y -x)u{y)dy. 

Jy 

Then, 

{ C if s <1, 

Clog dist (x, dy) if s = 2, (3.2) 

Cdist”^ (x, dy) if s — 3, 

with C independent of x. 

For y = r and any s <3, 

\Ar[(l>i]{x)\<C, VxET 



is satisfied. 

Proof The proof of this theorem can be found in [13, 22, or 14]. 

In the following subsection, we consider cubature techniques for the col- 
location method. 
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3.1 Cubature Techniques for the Collocation Method 



From Theorem 3.1, it follows that it is problematic to use the collocation 
methods for strongly singular integral equations, i.e., s > 2, since, typically, 
one would have to evaluate [0^] at corner and edge points. However, esti- 
mate (3.2) does not necessarily imply that, for x E dj, the function [u] (x) 
is infinite. Typically for Cauchy-singular integral equations, i.e., s == 2, the 
function A^ [u] (x) tends to infinity as x approaches dj but remains finite at 
points X G 7. This behaviour is thoroughly analyzed and correction function- 
als can be computed (see [22] and [23]). In the cited papers, it was shown 
that it is always possible to split the integral into some point functionals and 
an integral having a weakly singular integrand. Then, either Duffy’s triangle 
coordinates or polar coordinates smoothes the integrand such that it can be 
treated efficiently with properly scaled Gaufi-Legendre formulae. Here, we 
do not go into the details but proceed with considering the nearly singular 
situation. The arising problem is to compute 



Ia 




k{xi,y,y - Xi)(j)i (x) dx. 



(3.3) 



where Xi denote a collocation point having a positive distance from the surface 
panel A, 

dist {xi, A) = S > 0. 

In this case, the part-fini integral reduces to the Rieman integral, since the 
integrand is regular. On the other hand, the nth derivative of the integrand 
behaves like O {n\h~^) resulting in a very poor convergence speed of Gaufi- 
Legendre formulae. Different attempts have been made to overcome this prob- 
lem. Subdividing the triangle in the direction of the singularity was presented 
in [1] and [2]. This method wats generalized by introducing variable cubature 
order in [21]. We present here a semi-analytical technique which was worked 
out in detail in [11]. It was shown that the integral (3.3) can be computed to 
any desired accuracy by computing integrals of the type 



L 



{m+SAr 

yj (^1 + <^a) H" ^2 -h 






dAu 



where 5 a denote the horizontal distance of X{ from A and 5x the vertical 
distance. By introducing shifted polar coordinates of the form 



u = (r — 5 a cos a) 



cos a 
sin a 



the integral can be written as 
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+ + l /»«! 



6^1 



S / 

irrO 






^fi(a)4*(5A COS a 

+ (5^ sin^ a + Sl 



(sin a, cos a) 
r* 

:^drda, 



where (*? *) 2 ire polynomials, R{a) is a smooth function and amin, 

Q^max ^ [“'7r,7r[. The inner integration can be performed analytically by using 



/ 



a Vr^T^ 



1 Pi^a,r) 

\/r2 + 52* 






log ^±r -h y/r^ 




where p is a polynomial of degree less or equal than -2±nz:3 ^ 

are some coefficients. It can be shown that the integrand of the outer inte- 
gration has a reduced singular behaviour and can be computed by properly 
scaled one-dimensional Gaufi-Legendre quadrature. A careful error analysis 
(see [11]) shows that the order of the angular integration has to be increased 
logarithmically with decreasing distance S and triangle size diamzi. 



3.2 Cubature Techniques for the Galerkin Method 



For the Galerkin discretization of BIE, the elements of the stiffness matrix 
are given by 



Ki, 



= <l>i (x) p.f. / k{x,y,y- x) (f)^ 

J SUpp</>t J supp</>, 



supp</>j 



{y) dydx. 



The support of the basis functions consists of a few triangles. It would be 
desirable to split the computation of Kij into a sum of integrals over pairs 
of panels Ax x Ay. This is not trivial in view of the regularization process 
involved with the part fini integral. From Theorem 3.1, we know that 

Ar [<l>j] {x) := p.f. j^k{x,y,y- x) {y) dydx 



is bounded and hence. 



/ <l>i (x) Ar [<t>j] (x)dx= / (j>i (x) ^ AAy [<t>j] (x) dx. 

^ Ay^SUpP<i>j 



Unfortunately, [(j>j] may contain strong singularities at corners and edges 
of A such that the sum may not be interchanged with the outer integration in 
general. The following concept of introducing an outer regularization process 
was developed in [14]. For this, let the (5-strip A^ be defined by 

A^ {x £ Ax \ dist (x,(9i7) < (5} . 
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Then, it was shown in the cited paper that 
4>i (a;) Ar [<t>j] (x) dx 

= Y] p.f. / / (f>i (x) k (x, y,y-x) <f>j {y) dydx (3.4) 

.4j,€supp,^j ' JA^\B,(x) ^ 



— : Ka X Ay (e) 



-p.f. p.f. / / <j)i (x) k (x, y,y-x) <f>j (y) dydx . 

S « Ja^ JAy\B^{x) 



= : K^xAy (e,5) 



First, we consider the approximation of the term p.f. p.f. In 

6 € 

[14], it was shown that this term vanishes for Cauchy-singular and weakly 
singular integrands and in any case if An Ay is either empty or a vertex, thus, 
we may restrict to the case s = 3 and An Ay is either an edge or a panel. 
Since A^ 0, we may introduce several simplification of the integrand and 
integration domain which leaves the limit unchanged. It can be shown that A^ 
may be decomposed into a sum containing integrals over rectangles em x (0, 5), 
where Cm denotes an edge of A. The triangle Ay may be replaced by a suitable 
square Qm- Furthermore the basis functions may be replaced by constant 
continuation of the traces (f)i |e^ vertical to Cm ^tnd the kernel function by 
its principal part. Finally, the (curved) triangles may be replaced by suitable 
fiat triangles. Thus, it is sufficient to develop techniques to compute 

■— P-f- P-f. I I (^) ^principal {U ~ {u) dydx. 

^ ^ Jerr.X{0,S) JQm 

We introduce relative coordinates hy z = y — x and define the polynomial 
bij (x,z) := ^i^rn (x) ^j,m {z + x), yielding 



pj 



L 



D{S,e) 



^i,j ^principal dzdx 



with a suitable four-dimensional domain D{6,e). It turns out that one in- 
tegration can be carried out analytically due to the polynomial character of 
bij. In many cases the remaining integrand can be computed analytically, 
too. This has been worked out in detail in [14] for the case that the principal 
part of the kernel function is given by 

d‘^ C 
duydux \\x - y\y 



i.e., the underlying differential equation is the Laplace or the Helmholtz equa- 
tion. The result is that for linear elements and triangles A, Ay with a common 
edge 
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IS := 1^.1^ 



9 vtanw 9 3 ^ 2 



)■ 



where u denote the (smaller) angle between the planes through A and Ay. 
The coefficient C{j is either 0, 1, or 2 dependent on the indices j. This for- 
mulae can further be simplified if the case A = Ay is taken into consideration. 
Here, we skip the details. 

Now, we investigate the approximation of the quantity K^xAy (^)- Since 
this integral is regular, we may use the usual rule of transformation of vari- 
ables. In [10], it was shown how KAxAy (e) can be approximated by a sum 
of integrals of the same form over flat triangles. Hence, we restrict to this 
situation and in view of (2.8), we assume that k{x,y,y — x) is constant in 
the first both variables on the integration domain D := A x Ay. This means 
that k(z) := k{x,y,z)^ where we skip the tilde in the sequel. We have to 
distinguish the following situation. 



1. The singular case: AC\ Ay ^ ib. 

2. The nearly singular case: dist (Z\, Z\j,) = > 0 but “small”. 

3. The regular case: dist (Z\, Ay) > C > 0 with C = O (1). 



These cases will be considered in the following subsections. 

3.2.1 The Singular Case: AC\ Ay ^ ^ 

Due to the space limitations of this paper, we consider here only the case of 
A = Ay, where the general situation is considered in [19], [10] and [14]. The 
restriction to the case that A lies in the (l,2)-plane is not essential but will 
simplify the notations. We consider the integral 

Q := / (j)i{x)k{x-y)(t)j{y)dydx. 

JAxA 

\\y-x\\>t 

The key point of the transformation procedure is the use of relative coordi- 
nates: 

y = ^ -f X. 

Let the polynomial h be defined by bij {x, z) := <j>i (x) (f>j {z -i- x). The integral 
above then can be written in the form 

/ bij (x, z) k (z) dzdx 

Jd 

M>e 

where the domain of integration is a four-dimensional polyhedron given by 

D := {{x,z) € I 3(x,y) eAxA:z = y-x). 

This polyhedron can be parametrized also in the opposite way such that 
X depends on 2 :. This means that the integration with respect to ^ can be 
interchanged with the x-integration. We obtain 
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Q 



-L 



k{z)bij {x,z)dxdz. 



Ikll>e 



Since 6 is a polynomial and D is a polyhedron, the ^-integration can be 
carried out analytically yielding a piecewise polynomial function Bi j {z) := 
f bij (x, z) dx. Splitting the remaining integration domain into subdomains 
where Bi j (z) is a polynomial and further affine transformations results in 

^ r 

Q=Y1 P-^- / ^ {z) dz, 

h\\z-Am\\>e 



where the functions are polynomials in The remaining integral can be 
computed using polar coordinates about Am - 




rdrda. 



It turns out that the r-integration can be valuated analytically very easily and 
the limit process, too. The integrand of the remaining angular integration is 
smooth and can be approximated efficiently using properly scaled Gaussian 
quadrature rules. 

3.2.2 The Nearly Singular Case 

In the case that dist (zi, zij,) =: (5 > 0 is small the convergence of standard 
cubature formulae becomes very poor. To overcome this problem, it is pos- 
sible to write the integral over zi x as a sum over domains containing 
the singularity and proceeding as explained for the singular case (see [19]). 
Here, we will explain briefly a further alternative which is very similar as de- 
scribed above for the collocation method (see [14]). Again the starting point 
is the introduce relative coordinates y — x = z and shifted polar coordinates. 
The r-integration can be done explicitly resulting in simple formulae for the 
primitive integrals having a reduced singular behaviour. The remaining three- 
dimensional integral can be approximated by properly scaled Gaufl-Legendre 
cubature, i.e., the order of integration again has to be increased logarithmi- 
cally with respect to the diameter of the triangles and the distance 6. 

3.2.3 The Regular Case 

In this case the kernel function has no singular behaviour. Standard cuba- 
ture techniques as tensor Gaufi rules may be applied having the well known 
fast convergence properties. However, the question arises how accurate the 
integrals have to be computed such that the overall asymptotic discretiza- 
tion error is not reduced. In [17] it was shown what degree of exactness the 
inner integration and the outer integration must have, dependent on the or- 
der of the integral operator, on the order of approximation, on distance from 
the singularity, and the norm in which the error is measured, such that the 
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asymptotic convergence order of the overall discretization is not influenced. 
In order to illustrate the results, let us assume that linear finite elements 
are employed, the distance S from the singularity is 0(1) and the integral 
operator K maps the Sobolev space onto The error is measured 
in the /f^-norm. The following table shows pairs of numbers, where the first 
one denotes the degree of exactness of the outer integration and the second 
one for the inner integration. 



Table 3.1. Required degree of exactness (7x>7y) in order to get an optimal 
discretization order with piecewise linear functions in the if^-norm. 



s 


IB9E9 


p = 0 




P = -l 




p = -2 


mm 


p=-3 


+5 


(2,1) 


(1.1) 


(2,2) 


(1,2) 


mm 


mm 


- 


- 




- 


(2,1) 


(3,2) 


(2,2) 






- 




biJ 


- 


- 


(2,2) 


(3,2) 


mwm 




(3,4) 


(2.4) 



4 The Panel Clustering Method 

The boundary element method leads to a system of linear equation which 
contains a full coefficient matrix of dimension n. The reason lies in the 
fact that the kernel function k {x^y^x — y) contains a term of the form 
||a: — y|| linking every point x £ F with a point y £ F. The panel clus- 
tering method was developed by Hackbusch and Nowak in [9], [8] for col- 
location discretizations and generalized to the Galerkin BEM in [19], [10]. 
The idea of the method is the approximation of A by a series of the form 
k {x^y^y — x) (x) (y) with a suitable function system 

Hence, the ^-integration is separated from the ^-integration reducing the 
amount of work substantially. The efficiency of the method depends on how 
fast the kernel function can be approximated by the function system 
i.e., the number of term in the series and on the question how efficiently 
the arising quantities can be computed and organized. This will be discussed 
in the following. We recall here the setting of Petrov-Galerkin methods for 
boundary integral equations as defined in Chapter 2 and the notation of the 
trial space by Xn = span {0i}i<^<^ and test space Yn = span 



4.1 Kernel Expansions 



The kernel functions appearing in the definition of integral equations are 
related to the singularity functions : -> M of the underlying boundary 

value problem. This function can be written in the form 



s 



kl>t 



Cu 



jy-^r 
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where i/ G Nq denote multi-indices and |i/| i/^ -h 1/2 -f- 1/3. Let zq yo - xq 

be “large”. Consequently, s{z) — s {y — x) is smooth in a neighborhood z G 
U (^o)- We expand s as a series of order m about zq: 

m—l u 

»w = EEv.- u{zo)uju{x)w^{y) + Rm{zo,z) (4.1) 

|i^|=OM =0 

with some coefficients k. The kernel function is a suitable Gateau derivative 
of s, where we restrict here to normal derivatives. Hence, we can use the 
expansion 



^ ^ / Q \OL\ f d 

k{x,y,y-x)^ S u^(x)^—J (y) + 

gai-ha 2 

Example 4 Let s{z) = ||;2;||“^ which is (up to a constant factor) the 
singularity function of the 3-d Laplace operator. Taylor expansion about 
^0 = yo - xq results in 



, , : ai'i , , 

H=o. 



{z Zo) + Iim(zo,z). 



Z=Zo 



— • Kit (zo ) 



Putting z = y - X, expanding (y - x - zq)*" and reordering the terms results 
in 

m—l u 
\ u \=0 n =0 

which is of the form (4.1). 



Remark 4 -R For highly oscillatory kernel as, e.g., for the Helmholtz prob- 
lem where s{z) = IkIL Taylor expansion is converging slowly. The 

situation can be improved by first performing a 1-d expansion in the radial 
direction, i.e.. 



^ikr 

r 



^ m-l t.j .1 
1 ^ ifcro 

I'- 

1=0 



(r - roY 



and then applying standard Taylor expansion to the right hand side above. 
The error term behaves like and hence, a minimal requirement is 

that the order of the expansion has to satisfy 



m> k{r — ro) . 



(4.2) 
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On the other hand the function system consists still of polynomials and the 
algorithmic apparatus developed for the Laplace operator can be used. An 
alternative to this expansion is the use of spherical Hankel functions, i.e., 
writing 

Akr ^ 

— = 51 “ ^o)) 

^ 1=0 

which has a better convergence behaviour. However, (4.2) is the minimal 
requirement for convergence, too. For details see [16]. The use of this function 
system for Petrov-Galerkin discretizations makes the development of new 
algorithms necessary in order to compute the required quantities efficiently. 

In order to define the panel clustering algorithm, we have to define ap- 
propriate regions on the surface F where the expansion above gives good 
approximation. 



Definition 4.1. Let Tn denote the panelization of the F as, e.g., the tri- 
angulations explained in the previous chapters. A ^‘cluster” is the union of 
one or more panels: r = Uj=i ^ij • ® cluster is given by the 

^‘cluster radius " p (r ) which is defined by the radius of the minimal ball con- 
taining T. The centre Zr of this balls is called ^‘cluster center . 



Definition 4.2. The relative distance of a cluster from the support of a basis 
function of the test space cpi € Yn is given by 



d{(pi,T) 



p{t) 

dist (supp(/?i, 2 ^^) ’ 



where for the Dirac functional 5x, ie., the collocation method, suppcJ^ = x. 



Our aim is to determine the size of clusters such that the replacement of 
the kernel function by the expansion on those clusters has an accuracy of a 
given value e > 0. In this light, for e > 0 and arbitrary expansion order m, a 
cluster is said to be “admissible” with respect to a basis function (pi E , if 



\k{x,y,y-x) - km {x,y)\ 



1 

~"\\x-yr 



'iy £r,x e supppi. (4.3) 



The size of the clusters has to be linked to the approximation property of the 
kernel expansion in the following way. 



Assumption 4.3. For given e and expansion order m, there exists 0 < < 1 

such that, for all I <i < N, the condition 



d(pi,r) <r) <1 
implies that r is admissible with respect to pi . 
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Definition 4.4. Let e and an expansion order m £ N be given. Let the 
relative size r] of admissible clusters be determined as explained above. A 
set of clusters with disjoint interiors is called a covering of F, if 

\Jl, Ti = r. 

A covering is called admissible with respect to a basis function cpi G Yn, if 
either 



d{(pi,r) <T} {admissible cluster) 



or 



T is a panel. 



The admissible covering which contains a minimal number of clusters is called 
minimal admissible covering Ci. The nearfield and the farfield are 
defined by 



Quear ^ | ^ non- admissible with respect to (pi) ^ 

c/'*'’ :={reCi\Ti cr**"} ■ 



4.2 The Algorithm 



Let K denote the matrix corresponding to Petrov-Galerkin discretization of 
the integral operator. Then, a matrix vector multiplication can be written in 
the form 






= {^i,K[u]) = J^k{x,y,y-x)u{y)dy) (4.4) 



with the dual pairing (•, •) of (2.3) and the correspondence (3.1) of a vector u 
and a FE-function u. Splitting the surface into the nearfield and the farfield 
part and using (4.1) shows that (4.4) equals 



lipi.p.i. / k{x,y,y -x)(f)j 

j=i \ 



{y)dy 



= 



+ ^ Ljk{x,y,y-x)uiy)dy 

n m— 1 u / / f) \ 

i-1 1 ^ 1=0 ^ 

f f d \ 






J^(u) 
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n m— 1 V 

= E EE (4.5) 

This is the panel clustering representation of a matrix vector multiplication. 
The coefficient k are called the expansion coefficient and J!;! {u) the farfield 
coefficient. 

The definition of the so-called nearfield matrix looks identical as 

the original matrix K. However, since the integral over vanishes if 

supp^j n = 0, the matrix is sparse dependent on the size of 

Qnear 

The panel clustering can be structured in the following way. 

Phase I: 

- Choose the accuracy e for the kernel approximation. 

- Choose the expansion order m and relative size of the admissible clusters 
Tj — Tj (e,m) such that the storage/ computational consumptions (which are 
known a priori) are minimal. 

- Organize the cluster in a binary tree. 

- For each basis function of compute the near- and farfield of the 
minimal covering. 

Phase II: 

- Compute and store the nearfield matrix the expansion coefficients 

Sind the basis nearfield coefficients . 

Phase III: 

- Compute the farfield coefficients (u) from the coefficients (0i) by 
using the tree structure of the clusters. 

- Approximate a matrix vector multiplication by (4.5). 



5 Error Analysis of the Panel Clustering Method 



The panel clustering method introduces a further error in the discretization 
process which can be treated similarly as, e.g., errors due to numerical cu- 
bature. In the present situation local analysis has to be done to determine 
the dependence of the accuracy of the kernel expansion on the size of the 
clusters and the order of the expansion. As a typical example, we consider 
Taylor approximation of the singularity function of the Laplace operator in 
3-d. For given e > 0 and expansion order m, the bound for the relative size 
of admissible cluster rj has to be chosen as in order to obtain 





Efficient Realization of the BEM 



67 



where Tm denotes the Taylor expansion of order m of ||^||“^ This estimate 
and also error estimates of Taylor-based expansions of general kernel func- 
tions are worked out in [18], [19], [9] and [14]. 

In the second step, the influence of these local errors (valid on the admissi- 
ble clusters) to the consistency error, namely a matrix vector multiplication 
with and without panel clustering, has to be investigated. In [9], [19] and 
[10] it is shown that, under moderate assumption on the surface and the 
triangulation, the estimate 

\{ipu (K - Kpc) u)| < Cejh {<Pi, 1) IImIIoo > ^ X„, 

is satisfied with equals 1 for weakly singular kernels, is O (log h) for 
Cauchy-singular and h~^ for hypersingular kernels. Kpc denote the panel 
clustering operator where the kernel is replaced by expansions satisfying As- 
sumption 4.3 and (4.3). 

5.1 Complexity of the Panel Clustering Algorithm 

Let N denote the number of panels of the panelization Tjsf. An asymptotic 
complexity analysis of the panel clustering method (cf. [9]) shows that the 
choice of m = [^logiVj and r] = const implies that, again under moderate 
assumption on the surface and triangulation, the number of nearfleld panels 
is bounded independent of N, while the number of farfleld clusters grows like 
O (log AT). Furthermore, the asymptotic amount of work for the initialization 
Phase II of the panel clustering algorithms is proportional to O (Nlog^ N) 
with r = 5 for the collocation and 4 for the Galerkin method. The evaluation 
Phase III requires O [N log^ N) with t = 4 for the collocation and t = 7 for 
the Galerkin method. The storage consumption for the collocation method 
are of order O (iVlog^ A^) and for the Galerkin BEM of order O (AHog^ N). 
The powers of the logarithmic terms are not ultimative. The algorithm can 
be structured such that Phase II becomes cheap and the storage consump- 
tions are low (relatively few quantities are computed) but Phase III becomes 
expansive since more preparation steps have to be performed and vice versa. 

The asymptotic gain of the panel clustering method is obvious compared 
to the standard matrix techniques where both the CPU-time and the storage 
consumption behaves like O (AT‘^) . Nevertheless it is important to consider 
the constants of the O(-) estimates in detail to study the amount of work used 
by the panel clustering algorithm for “small” problem sizes. As a test problem 
we examined the Galerkin discretization of the double-layer potential for the 
Laplace equation on the surface of the unit cube scaled by (3 : 1 : 0.3). For 
the panel clustering and the conventional matrix oriented version the amount 
of time and space required to assemble the corresponding discrete operator 
K satisfying a given relative error e, i.e., \\Ku - Ku\\o < elljFCtx||o with 
K denoting the exact matrix, was measured. The accuracy in the case of 
K = of the panel clustering was controlled by the parameters rj and m. 
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In order to adjust the accuracy oi K = of the matrix oriented version 
the size of the nearfield, where high order cubature formulae are used, and the 
farfield, where only one-point formulae are applied, was changed. The results 
for a problem size of AT = 1154 vertices (2304 panels) are shown in Figure 
5.1 illustrating the amount of time needed to assemble the operators and in 
Figure 5.2 depicting the number of entries to represent these operators. 




Fig. 5.1. Time consumptions assembling the discrete operators. 




relative error 

Fig. 5.2. Storage consumptions of the discrete operators. 
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The calculations were carried out on a SUN SPARC-10/40. It turns out 
that even for small problem sizes the panel clustering method could reduce 
the CPU-time and storage consumptions considerably. Further results are 
reported in [19] and [14]. 



6 Software Design Aspects 

Regarding the software activities for finite element methods (FEM) there 
are already complex and powerful packages to solve various problems. It 
would be quite convenient to reuse this software or at least the design aspects 
for the boundary element method as well. Though even the conception of 
FEM and BEM are nearly the same, the aspects concerning design decisions 
differ considerably. The system matrices of the FEM, for instance, are sparse. 
Adequate data structures for this type of matrices are lists or other containers 
taking the local character into consideration. However, the BEM leads to full 
matrices making it quite plain to use arrays. There are a lot of other examples 
like this showing that it is unavoidable to develop BEM software fi:om the 
beginning. 

The difficulty one has to encounter is the great variety of tasks: There are, 
among others, different discretization schemes like collocation or Galerkin 
methods, different kinds of operators (single layer, double layer, hyper singu- 
lar) and quite a lot of cubature techniques to consider. A design strategy 
which isolates the common parts and Tocus upon the essential characteris- 
tics’ [3] to build the basic abstract data types would be very useful. These 
properties are given by object-oriented methods known to be a suitable tool, 
to manage complex systems. In the following we specify an object-oriented 
approach to BEM which has been elaborated as a class library implemented 
in C4-h. 

In our situation the object-oriented decomposition could be extracted 
from the formulation of Petrov-Galerkin schemes introduced in Chapter 2 
The foundation of this scheme are the subspaces Xn and Yn defining the test 
and trial functions. A basic information necessary to construct these spaces 
is a description of the geometry, i.e. the surface of the physical object. For 
this purpose it is advantageous to have a (continuous) description which of- 
fers the possibility to resolve the geometry with any necessary accuracy - an 
important fact, especially in the case of multilevel methods where hierarchies 
of spaces must be generated. So we can state a first class of the library called 
Surface representing a formal description of the surface: 
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class Surface { class SurfaceApx { 

public: public; 

Surface(char* fname); SurfaceApx(const Surfaced sf, int level); 

In the next step, we require an approximation of the surface, i.e. a set of 
panels (triangulation) that meets several conditions listed in Chapter 2 To 
maintain this set, a second class, called SurfaceApx, hats to be introduced 
into the context of geometry abstraction. In the class definition above the 
parameter level is used to distinguish between particular levels of approxi- 
mation. Instances of the class SurfaceApx for different levels form a hierarchy 
of panels. 

To classify a basis of the subspaces two well-known alternatives are avail- 
able. The first one, the node-oriented version, characterizes a single basis 
function by its support in conjunction with a related node. The basis is de- 
scribed by the set of all basis functions. This definition is obvious and close to 
the discrete mathematical formulation of the Petrov-Galerkin scheme. Nev- 
ertheless the second, panel-oriented alternative is more efficient in respect of 
the time-consuming integrations. It uses the corresponding shape function for 
each panel, i.e., the set of all non-vanishing basis functions restricted to the 
panel. The abstraction of the entire basis is formed by the recombination of 
all shape functions. An entity, representing a single basis function, no longer 
exists. 

The properties of common shape functions are reflected in the definition 
of an abstract base class: 

class ShapeFunction { 



public: 

virtual const Panel&i support() const = 0; 
virtual int dimension() const = 0; 
virtual const int'*' index() const = 0; 

}; 

It gives information about the related panel, the number of restricted basis 
functions and their associated indices. By the mechanism of inheritance we 
can generate subclasses of ShapeFunction to module different bctsis functions, 
as constant, linear or dirac functions, on different types of panels (Figure 6.1). 

For the instantiation and maintenance of shape functions a further class, 
Space, is established. This provider is responsible for the correct combination 
of shape functions to form a basis and thus the desired subspace of the Petrov- 
Galerkin method; especially it has to support the management of conforming 
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indices. To realize various subspaces, several specializations of the base class 
Space are used. Analogous to the subclasses of Shapef unction we could 
define ConstSpace, LinSpace or DiracSpace. 



[shapeFunctionJ 




(constQuadSP) (unTriSP) [piracVtx3P 




Fig. 6.1. Specializations of ShapeFunction, 



The next class in our library addresses the dual forms of (2.3). Corre- 
sponding to the construction of spaces the abstraction of a dual form is a 
function that assigns to each pair of shape functions (^, an element ma- 
trix of the form 

{m)^^ {K'lj), 0 ) , (f) e xp 

Embeded in a class definition we get 

class DualForm { 
public: 

virtual const real’*' operator() (const ShapeFunction^ triaifnc, 

const ShapeFunction&i testfnc) const; 

}: 

Before the assignment can be executed an appropriate integration or cubature 
rule dependent on the particular type of shape functions has to be chosen, 
e.g., to distinguish between collocation and Galerkin schemes. For this pur- 
pose run-time type information (RTTI) is an essential tool and used in our 
implementation to solve this kind of multi-method problem (see [24]). The 
class DualForm acts as an interface class which ‘adjusts the appearance’ [24] 
for further class definitions derived from it. With these subclasses a variety of 
integral operators and associated integrations strategies could be arranged. 

To assemble local element matrices in a global one the class MatrixOp is 
introduced into the design. It is a specialization of the base class Operator 
(Figure 6.2) which first of all is responsible for the abstraction of integral 
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operators.This is done by defining a virtual member function of the class 
Operator common to all subclasses which declares the mapping of the con- 
sidered operator: 

virtual void operator()(const Function^ triaifnc, const Function^ testfnc); 



(operator) 



(MatrixOp) 




( PnICIstOp ) \ (oMResOp) 



( SparseMatrixOp ) (cGOp^ 



Fig. 6.2. Specializations of Operator. 



For the clctss MatrixOp for instance this function is to be overloaded by 
a simple matrix- vector product. The class Function in the definition above 
just represents a valuation of the related basis . It can be implemented by a 
vector of floats or doubles. 

The subclass MatrixOp is used to specify the integral operator on the 
discrete level by a full coefficient matrix. Another alternative offers the panel 
clustering method described in Chapter 4 Its complex algorithmic issues can 
be totaly hidden in the implementation of an additional subclass of Operator 
denoted by PnlClstOp. The external behaviour or usage resp. of PnlClstOp 
is nearly the same as for the simple MatrixOp. Only the information needed 
during the instantiation process must be extended: a specialized version of 
the class DualForm to describe the far field dual forms ipi) 

and d^^/driyWu{y)) as well as an instruction to calculate the expansion 
coefficients 

Even the solver of the linear system interpreted as inverse operator meets 
this context. For example overloading the operator () by the gmres algorithm 
leads to the class GMResOp. An instance of GMResOp is initialized with the 
operator that should be inverted. 

To summarize the rough introduction of our class library we present a 
short application: 

int main() { 

// initialize the geometry given in file “cube" 

Surface sf(“cube“); 

SurfaceApx sfapx(sf, 2); 
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// construct test and trial spaces for collocation 
DiracSpace test(sfapx); 

LinSpace trial(sfapx); 

// get the necessary informations about dual forms and 
// expansion coefficients 
DlpDF df(3); 

FarFieldDF ftstdf(2); 

FarFieldDrvdDF ftrldf(2); 

LaplaceExp exp(2); 

// assemble the stiffness matrix and the panel clustering operator 
MatrixOp op(trial, test, df); 

PnICIstOp pcop(trial, test, df, ftrldf, ftstdf, exp, l-e2); 

// initialize two functions; one matching the trial space, 

// another the test space 
Function f(trial); f = ‘ sin(x’*'y)" ; 

Function g(test); 

// calculate {{Kf^ ^i))j with both operators 
op(f. g); 

pcop(f, g); 

// solve using the panel cluster operator 
GMResOp solve(pcop, le-8, 100); 
solve(g, f); 

} 

Finally the approach to involve parallel architectures, in particular SPMD 
schemes (same program multiple data), should be mentioned. The most time- 
consuming task of the BEM is to assemble the matrices. A distribution of this 
task among several processors can be achieved by decomposing the subspaces 
Xn and Yn leading to a two dimensional torus as proper topology of processors 
(Figure 6.3). To be independent of the given physical architecture an interface 
class ProcessNode is created which establishs basic operations in the terms 
of the underlying torus (e.g. to broadcast along rows or columns, the upper, 
lower, left or right neighbour): 

class ProcessNode { 
public; 

virtual int broadcast(. . . ) = 0; 
virtual Int reduce(. . . ) = 0; 
virtual int shift(. . . ) = 0; 

}; 

Specializations of this interface class realize the connection between the phys- 
ical layer (hypercube, workstation cluster, etc.) and the application layer (2d 
torus). 




74 



W. Hackbusch, C. Lage, S.A. Sauter 



To make the information and operations of ProcessNode available to the 
concerned classes of the library a reference to an instance of ProcessNode 
could be used. The virtual function call ensures the expected behaviour. 
Parameterized classes (templates) offer another, more efficient way of em- 
bedding the derived classes of ProcessNode in the library. In this alternative 
the member functions broadcast (), reduce (), etc. are bounded statically 
resulting in an efficient function call. 





y1 v2 v-^ 



cTjn- 






y2 



physical layer 



application layer 



Fig. 6.3. Abstraction of parallel architectures. 
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L. Jentsch and D. Mirschinka 



1 Introduction 

The topic of our investigations are special boundary-transmission prob- 
lems for the equations of mathematical physics, the so-called bimetal prob- 
lems. This is a special class of problems for piece-wise homogeneous bod- 
ies where the interface between the homogeneous parts intersects the outer 
boundary. The following two pictures illustrate the domains which will be 
considered. 





A useful tool for the solution of such kind of problems is the so-called 
Green’s contact tensor or contact function method. These Green’s functions 
were constructed for various problems of elasticity and stationary heat con- 
duction [7-10]. It follows from their properties that the associated boundary 
transmission problem can be reduced to a boundary integral equation over 
the outer boundary, i.e. the transmission conditions on the interface are a 
priori fulfilled. Our main investigations are concerned with the Fredholm- 
ness, invertibility, asymptotic behaviour of the solutions and the numerical 
analysis of the resulting boundary integral equations. In the first section the 
plane problems will be shortly discussed and we give some references for 
further informations. The main tool for the investigation of the boundary 
integral operators is the Mellin transform. The remaining parts of this pa- 
per are devoted to the three-dimensional case. We will discuss the analytical 
and numerical treatise of the boundary integral equations for the problem of 
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stationary heat conduction. In contrast to boundary value problems in non- 
smooth domains the theory of three-dimensional transmission problems (or 
interface problems) is rather incomplete. Only recently, Nicaise and Sandig 
studied such problems in a more general context (s. [21], [22]). But their 
results does not cover the case of three-dimensional interface edges. More- 
over, the topics of these two papers are solvability in certain function spaces 
and the asymptotics of the solution, while we are interested in the bound- 
ary integral equation method for the solution of the transmission problem. 
In [25] solvability and asymptotics of general elliptic interface problems in 
an infinite wedge are investigated. While Mellin techniques are successfully 
used in the plane theory the three-dimensional problem seems to be more 
difficult with respect to the analytical and numerical treatment. The double 
layer potential operator for our problem is an integral operator with fixed 
singularities along the edge of the boundary surface. Similar operators were 
considered by Duduchava and Schneider [3] , [26] . In contrast to the smooth 
Ccise the double layer potential operator is no longer compact in the consid- 
ered function spaces. With a local principle the treatise of this operator can 
be reduced to the study of a family of integral operators on the half-plane. 
Such operators were studied by the authors [20], [13] via a harmonic analysis 
approach. We will prove the invertibility of the boundary integral operators in 
a scale of weighted Sobolev spaces extending our earlier results [19]. Besides 
the analytical treatment of the boundary integral equations we will study 
spline approximation methods for the numerical solution of the boundary 
transmission problem. To obtain optimal order convergence results we use an 
appropriate mesh refinement near the interface edge. Similar results for the 
harmonic double layer potential operator on polyhedral boundaries were ob- 
tained by Elschner [5], [6]. The problems of the linear elastostatics are much 
more difficult. Nevertheless, it was shown in [13] that the associated opera- 
tors can be studied with the same approach as in the case of the Laplacian. 
But the question of Predholmness of the boundary integral operators would 
lead to an invertibility problem for a family of 6 x 6-matrices with nonhomo- 
geneous Mellin pseudodifferential operators as entries. 

For the special geometry of a bimetal ball Poisson formulae for boundary 
transmission problems of stationary heat conduction were obtained by one of 
the authors in [15]. 

Together with D. Natroshvili one of the authors considered non-classical in- 
terface and mixed interface problems for anisotropic elaistic bodies in [16], [17]. 
The main tool for the solution of such problems via boundary integral equa- 
tions is the theory of pseudodifferential operators on manifolds without and 
with boundary. The case of stationary elastic oscillations is investigated in 
[18]. 
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2 Two-dimensional Problems 

In the case of elastostatics the following boundary transmission problem will 
be investigated. 

a) Differential equations for the displacement vector u{x) : 

fjtiAu -f (Ai -f )Ui)grad div u = 0 in D+, 

/jLoAu -h (Aq -h /xo)grad div u = 0 in D~ . 

b) Transmission conditions at So : 

{u}'^ - {u}“ =0 at 5o, 

{t (Sa;, n)u}’^ = {t (a*, n)u} at So, 

where T is the traction operator with the Lame constants A^, fii 

dvji 

T {dx, n)u = ^ ^ 5 

and {.}“ denote the traces at Sq from D+ and D_, respectively. 

c) Boundary conditions of the Dirichlet or Neumann type at 5 = 5“^ U 5“ . 

In the sequel we restrict ourselves to the Neumann problem, i.e. the traction 
at the boundary is given. We will apply the indirect boundary integral method 
for the solution of this problem. Then the program for this method is the 
following: 

a) First, we make a single layer ansatz over S with the contact tensor G{x,y) 
in the kernel of the potential 

u{x) = j G{x,y)<p{y)dSy, (2.4) 

s 

which fulfils (2.1)-(2.3) a priori. 

/3) Applying the traction operator to (2.4) and after that the trace operator to 
S we receive a boundary integral equation A(p = f , with a singular matrix 
integral operator A which has additional fixed singularities at the points 
Pi and P_i. 

7 ) Treatise of the operator A : Predholmness and index of A, solvability con- 
ditions for the equation A<p = f . 

(5) Study of the asymptotics of at the corner points P\ and P_i. 
e) Asymptotics of the solution u(x) at Pi and P_i by computing the potential 
(2.4). 



( 2 . 1 ) 

( 2 . 2 ) 

(2.3) 
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A localization procedure reduces this program to the study of two Mellin- 
convolution operators on the interval [0, 1] associated with the two corner 
points P\ and P-\. The properties of the symbol matrices are connected 
with the essential properties of the operator A. If the symbols are regular 
on the line Re z = ^ the operator A is Fredholm in L 2 {S) and the index 
can be determined by the function-theoretic index of the determinants of the 
matrices. The singular exponents of the asymptotics of (p at the corner points 
can be computed via the zeros of the determinants of the symbol matrices 
in the half-plane Re < | (s. [12]). Then the asymptotics of the solution 
u{x) of the boundary transmission problem can be computed from (2.4) (s. 
[14]). In the case of perfect and non-perfect stationary heat conduction this 
program was studied in [10]. 

The investigation of numerical methods for the solution of the boundary 
integral equations will be restricted to the case of stationary heat conduction 
in a polygonally bounded bimetal domain, i.e. we look for a solution of the 
following problem 

Au = 0 in D_|_ U D- (2.5) 

7 +u = j-u at So (2.6) 



ju = g at o , [Z.Q) 

where 7 , 7 +, 7 __ denote the traces at S and at So from D+ and D_, respec- 
tively. For the solution of this problem we consider a double layer potential 

ansatz i /* \ 

_ 1 f dG{x,y) 






v{y) dSy, 



such that the differential equation and the transmission conditions are ful- 
filled. Here G{x,y) is the contact function 

[ - In |x - y| - ^ In |x - y\ x,y £ H+ := {x 2 > 0} 



G{x,y) = 



-ln|x-j/|-^^ — — ^ln|x-j/| x,y £ H- := {x 2 <0} 
A-i- -h A_ 



A+ + A. 



X £ H+ ,y £ H- 



A+ + A- 



ln|x- j/| 



X £ H- ,y £ 



(2.10) 
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with y = (j/i,-j/ 2 ). 

The boundary condition yield the integral equation 
{-I+K){v){x) := I v{y) dSy = g(x), x G S (2.11) 

for the unknown density v. The boundary integral operator K is at the cor- 
ner points of Mellin type. To obtain optimal convergence results we use an 
appropriate mesh refinement near the corner points. The graduation depends 
on the corner angles and the material parameters or more precisely on the 
asymptotics of the solution of the problem at the corner points. On a mesh An 
we consider spline spaces S^{An) consisting of discontinuous piece-wise poly- 
nomials of degree less than d which are constant in the subintervals containing 
the corner points. To obtain a fully discrete scheme we define an approximate 
operator Kn by replacing the integral in X by a quadrature formula. If we 
put Qn for the interpolation operator with respect to the nodal points of the 
quadrature formula we obtain the following approximation scheme: 

Find V G S^{An), such that 



Vn “I" Qn^n'^n — Qn9 • (2.12) 

This is a linear system for the values of Vn at the nodal points of the quadra- 
ture formula. The Nystrom solution results from 

K = -9 + KnVn . (2.13) 

If R is the order of the quadrature formula, the grading parameter q is appro- 
priately chosen and the right-hand side g belongs to a weighted Sobolev space 
(s. [4]) the Nystrom method (2.13) converges with the order of 0((logn/n)^) 
in the Loo-Norm. 

In our experiments we used piece- wise linear splines and the trapezoidal rule, 
i.e. R = 1. The considered domain is a quadrangle with the corner points 
(1, 0), (1.5, — 1), (2, 0), (1.5, 1), the interface between the first and the third 
point and the boundary data is given by the function G(x, y) with y = (2, 2). 
The following tabular contains the Loo-error on U Z)_ between the exact 
solution of the boundary transmission problem and the approximate poten- 
tial KnVn in the interior and 2N is the number of collocation points at each 
side of the quadrangle. 
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0.353147 


3 


4 


0.384615 




9 


0.253569 




9 


0.345776 




9 


0,376587 




24 


0.046240 




24 


0.063054 




24 


0.068673 




40 


0.009005 




40 


0.012280 




40 


0.013374 




79 


0.000861 




79 


0.001175 




79 


0.001279 




156 


0.000362 




156 


0.000494 




156 


0.000538 



3 Three-dimensional Problems 

Let be a three-dimensional domain with piece-wise smooth Lipschitz 
boundary consisting of two parts and a common plane interface 

So (s. the picture above). 

The surfaces 5_ and the curve F are assumed to be smooth but for 
the whole boundary S an edge along F with nonconstant edge angle will be 
allowed. The normal vector n at 5 and So is directed outside from D and 
D_i_, respectively. 

We consider the following boundary-transmission problem of bimetal 
type: 



Au = 0 

V{l+u - 7_u} = ^+7+^ 

. du . du 
ISU = g 



D\So 


(3.1) 


at So 


(3.2) 


at Sq 


(3.3) 


S 


(3.4) 



where A_|_ , A_ , rj are positive reals and j± and 75 denote the trace operators 
from and D_ on 5 q and 5, respectively. The function is a given function 
from a suitable function space. The problem (3.1)-(3.4) will be solved using 
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certain Green’s functions for contacting half-spaces such that the transmis- 
sion conditions are a priori fulfilled. We include the case r] = oo of perfect 
heat contact where (3.2) becomes 



= j-u at So- 

In the latter case the Green’s contact function is given by 



Goo{x,y) = < 



1 



-h 



A+ -A_ 



x-y\ (A+ -h A_) I x-|/ 



1 



-h 



A_- A4 



^-y\ (A+ + A_) I X-?/ 

2A_ 



(A+ -f- A_) I a: - y I 
2A4- 



for X, y € 
for x,y € Mi 
for X G M^ , y G Mi 
for X G Mi , y G M^ 



(3.5) 



I (A+ + A_) I X - y I 

where y denotes the point (yi, y2, -ya), and M^ := {x G M^ : X3 > 0}, Mi := 
M^ \ M^ denote the upper and lower half-space, respectively. For nonperfect 
heat contact, i.e. rj < 00 , the Green’s function was calculated in [10] as 

Gni^^y) — GoQ^x^y) -f K{x^y) , 



with 






1 2A- 

27t A-|- 4- A_ 

1 2A+ 

27t A-}- -f" A_ 
1 2A- 

27t A-f. “h A_ 

1 2A+ 





if x, 2/ G E+ 


7(x,y) 


if X, y G El 


i{x,y) 


if X G E^ , y G El 


•7(x,j/) 


if X G El ,y G E^ 



(3.6) 



and 



dt 



27t A-|- -f~ A— 

I{x,y) = 27ry j^exp[-(|x3| + IvsMJo ~ V\)^ + {^2 - y 2 f) 

” “ (3.7) 

where a — . We note that I{x,y) is the Hankel transform of order 

A-|- A_ 

zero of the function ^ Since this function decreases 

t (X 

exponentially at infinity a fast Hankel transform technique can be applied to 
compute (3.7) approximately [23]. In [10] it was shown that 
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I{x,y) = 



27T 



-f- {X2 - V2Y + (kal + lyal)^ 



+ R{x,y) , (3.8) 



where R{x,y) has only logarithmic singularities at the plane {xs = 0}. 
This implies that the Green’s function admits a representation G{x,y) = 
Go{x,y) -f Ro{x,y) with 



r 1 1 

-h 



Go{x,y) = < 



x-y 

1 



x-y\ 

1 



\x-y\ \x-y\ 

0 

0 



x,y 

x,y 

X £ , y £ 

X £ Ml ,yeR\ 



(3.9) 



and Ro{Xj y) hats only logarithmic singularities. We note that the main part of 
the Green’s function in the case of nonperfect heat contact does not depend 
on the material parameters. Moreover, the interaction between the two half 
spaces is only weak. 

For the solution of our boundary-transmission problem (3.1)-(3.4) we will 
make a double layer potential ansatz over the boundary with the Green’s 
contact function, i.e. we look for a solution in the form 

u(x) = :^ l^^^^v{y)dSy, x£D+UD.. (3.10) 

s ^ 

We note that this ansatz fulfils the transmission conditions at So a priori. 
The jump relation of the double layer potential yields the boundary integral 
equation 

iAv){x) := -v(x) + ^j v(y) dFy = g{x) , x € S. (3.11) 

r ^ 



The integral operator in (3.11) is in generally not a compact operator in L*2(5) 
due to the non-smoothness of S. Therefore, the theory of second kind integral 
equations cannot be applied to the analytical treatment, and the theory of 
collectively compact operators for the numerical analysis does not work. The 
main topic of this paper will be the analytical and numerical treatment of 
equation (3.11). 

We will finish this section with some results concerning existence and 
uniqueness of the solution of the problem (3.1)-(3.4). The following lemma is 
a consequence of the Green’s formula. 
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Lemma 3.1. Let u fulfil (3A)-(3.3) and let G H]^{D±) := {u e 
H^{D±): Au\d± G L2{D±)Y Then u admits the representation 



^ ^ J Grj{x,y)Au{y) dx 

D 









y) 



dujy) dGrijx, y) 



dn,, 



dn„ 



u{y)) dSy, 



Theorem 3.1. Let the solution u of the homogeneous Dirichlet problem fulfil 
the assumptions of the previous lemma. Then u is zero. 

Proof. Applying Green’s formula in D+ and D_, respectively, we obtain 
A+ j Igrad u\‘^dx = -A+ J u-^dS 



D+ 



So 



A_ j Igrad u\‘^dx = J • 



So 

The transmission conditions on Sq imply 



J Igrad ufdx + ^- J Igrad u\‘^dx = -rj ^A+ j dS < 0 

D+ D_ io 



D+ D 

which finishes the proof. 



□ 



Theorem 3.2. Let g G Then forrj = oo and rj < oo the Dirichlet 

problem is uniquely solvable in H^{D) and H^{D+) x respectively. 

Proof Since the uniqueness follows from the previous theorem we only have 
to show the existence of a solution. For the case of perfect heat contact let 
us consider the form 

w) := A+ J grad v grad wdx + j grad v grad wdx 



D_ 



with v,w £ H^{D). Then the estimate 



$i{v,v) > c||u||^ 

holds for V G Hq(D), i.e. is elliptic on Hq{D). Hence, the variational 
problem 

(u, u) = (/, v) for all u G {D) 
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is uniquely solvable in Hq{D). Let g € H^{D) be an extension of g into D 
and let w be the solution of 



^i{w,v) = ~^i{g,v) for all v G 



Then the function u := w g is a, solution of the problem (3.1)-(3.4). 

In the case of nonperfect heat contact the variational formulation is the fol- 
lowing: Find u £V such that 

^i(u,v) + j ^ ~ ^ ^ ^ 0 , (3.12) 

So 

where V C x consists of functions which fulfils the bound- 

ary condition and the transmission condition (3.2) on Sq and Vq is the sub- 
space of V with zero boundary data. The coercivity of the form (3.12) is 
obvious. To show the boundedness we have to consider the integral over 5 q. 
With the transmission condition we obtain 



5o 






dv 

7+u-7_u,74- — 



L2(So) 



and since the trace of the function and of its normal derivative is a con- 
tinuous map from the //^-spaces to and i/”^/^(5o), respectively, 

the last quantity is bounded up to a constant by the product of the 
X H^{D-)-noTms of u and v. The construction of the solution is 
now the same as in the case of perfect heat contact which finishes the proof. 
□ 



4 Boundary integral operators 

This section is devoted to the study of the boundary integral operators as- 
sociated with the problem (3.1)-(3.4). First, we will introduce the single and 
double layer potential with the Green’s contact function 

{Vw){x) = -^ j Gr,(x,y) w{y) dSy , x^S (4.1) 

5 

(W^«;)(x) = T I w{y) dSy , x^ S. (4.2) 

The trace of F on 5 will be denoted by Vb, i.e. (Vow)(x) := (jsV)ix), x £ S. 
For the double layer and the normal derivative of the single layer potential 
the well-known jump relations hold 
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I (x) = ±w{x) 4- {Knw){x) , X £ S (4.3) 

V ons ) 

( 75 ^^) (^) ~ Tw{x) 4- (Kdw){x) , X € S', (4.4) 

with the direct values Kd and i^iv of the double layer potential and the 
normal derivative of the single layer potential, respectively. 

For A (A+,A_) with A+ > 0, A_ > 0 we define an equivalent inner 
product on L 2 {S) by 

(u,v)x '= A+ / uvdS J uvdS . 

5 + S- 

Then the following theorem holds. 

Theorem 4.1. The operator Vo is an isomorphism between and 

Moreover, it is selfadjoint with respect to (.,.) a ; « 

{Vou,v)x = {u,Vov)x Vu,t; e 

Proof. The selfadjointness with respect to the modified duality product is 
obvious. To prove the invertibility we will show that the operator Vb is positive 
definite. We restrict ourselves to the case rj < 00 (The proof for the perfect 
contact is similar [19].). Consider the forms 

grad u grad wdx T A_ / grad u grad wdx 

D+ D_ 

^a{u,w) := A+ / grad u grad wdx -h A_ j grad u grad wdx 

rIXdT 

for u, to belonging to a subspace of x such that the interface 

condition (3.2) is fulfilled on the plane {xs = 0}. Then we have obviously (s. 
section 3.) 

{X3 = 0> 

(4.5) 

We put ^{w,w) for the left-hand side of (4.5). Let h%u\—Vv. The properties 
of the Green’s function imply that u belongs to the space ) x ) 

and fulfils the interface conditions. Furthermore, the gradient of u is in 
L 2 {R^). Green’s formula together with the jump relation for the normal 
derivative of the single layer potential yields now 



$i{u,w) := A+ j 



^{u,u) = {VoV,v)x- 
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Since the trace map 7^ is continuous from (M^ ) x (E ?_ ) to H 
we have 

lbllH-l/2(5) < C ^||u||/^l(R^)xHl(R^)j , 

which together with (4.5) implies the desired estimate. □ 

It follows from the previous theorem that Vb generates an equivalent norm 

Theorem 4.2. The operator I + Kn is a Fredholm operator with index zero 
in and the a priori estimate 



{VqV, (/ + Kn)v)x > Co||w||_i/2 - Cl 



2 



j VovdS 

S 



holds. 



Proof. The previous theorem and the continuity of the trace map imply 

11^11^-1/2(5) - ||Vbt/||^l/2(5) < Cr2\\Vv\\‘fji(^i^y 

For the last quantity we obtain with Poincare’s inequality 






I 



\gTdid{V v)\^ dx 4- 



JvovdSl 

s 



(4.6) 



We consider the form ^i(VVjVv). Green’s formula applied in and D_ 
yields 

^i{Vv,Vv) = {Vov, (/ + Km)v)x . (4.7) 

Since ^i{Vv,Vv) and f |grad(yv) are equivalent we get the estimate of 
D 

the theorem. It follows now that 7 + is a semi-Fredholm operator with 
finite dimensional kernel in Using the associated version of (4.7) 

for the outer domain we obtain that — 7 -f- TCjv is semi- Fredholm. Obviously, 
for any a € C, |a| > 1 the estimate 



\{yov, {al -h Kn)v)x\ > co\\v\\-\/2 - ci 



2 




s 



holds. But al + Kn is invertible for sufficiently large a which implies the 
assertion of the theorem. □ 



Theorem 4.3. The operator — 7-h Kd is invertible in 
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Proof. The operator Kd is the adjoint operator of Kn with respect to the 
inner product This together with the previous theorem implies that 

the operator -I-\-Kd is Fredholm with index zero in the space 
Since the kernel of -I-\-Kd is trivial due to the uniqueness of the Dirichlet 
problem the proof is finished. □ 

It follows from the previous theorem that the operator —I+Kdis strongly 
elliptic in with respect to the dual norm of |.|vb- This implies the 

stability of any Galerkin method with an appropriate space Xn of ansatz 
functions, but with test functions of the form ko(Xn). Since this is not very 
applicable we will study the double layer potential in L^-spaces on the bound- 
ary. While the previous considerations were motivated by the boundary value 
problem, basically, the following approach is based on the theory of singular 
integral operators. Thus, the techniques used now are local principles and 
convolution type operators. 

We will consider the indirect boundary integral method using the double 
layer ansatz (3.10) for the Dirichlet problem (3.1)-(3.4). The jump relation 
for the double layer potential yields the integral equation over the outer 
boundary in L 2 {S) 

- v(x) + -^ j = six) ■ ( 4 - 8 ) 

5 

The assumptions on the boundary 5 imply the following theorem. 

Theorem 4.4. Let he x £ S\F. Then the operator Kd is locally compact 
(s. [23]) in X. 

Proof. The theorem is a consequence of the local compactness of the usual 
harmonic potential operators at smoothness points of the surface. □ 

Let be Xo G r fixed. We put a = a{xo) and (3 = f3{xo) for the angles 
between the tangential half-planes at xq and the plane 5 q. Let f/ C 5 be a 
neighbourhood of xq consisting of two parts = S^DU and U- = S- DU 
such that and U- are diffeomorph to a neighbourhood of zero in the 

half-plane and the image of xq under booth mappings is the point zero. 

We denote by r+ and r_ two such diffeomorphisms and by %+ and x- the 
characteristic function of I/_|. and I/_, respectively. Then the restriction of 
the operator on U has the following matrix representation: 

f -I + X+KdX+ X+KdX- ^ 

V X-K^dX+ -J^ + X-t<^DX- ) 

where / is the identity at and [/_, respectively. Composing this matrix 
operator from the left with 



and from the right with 



0 



0 

r_ 
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0 




we obtain an operator Ad up to now defined only on a zero neighbourhood 
of the half-plane . In the terminology of the local principle of Simonenko 
[23] the operators —I -h Kd and Ad are xq — 0-quasi-equivalent. Extending 
the operator Ad on the whole half-plane by zero this assertion remains true. 
Using the arguments of [1], pp. 209-211, one can show that the operator Ad 
is 0-equivalent to the following operator Ad 



Ad 






~ G2a 0 
0 -IAG20 



in the case rj < 00 and 



/ 



Ad = 






A-)- + A— 
2A+ 



Gia ^ 



A-l -i- A_ 



V A+ -f A_ 



G ^+0 -/+^+ 



Go 



for 7^ = 00 , where 

iG^u)ix^,X2) :=- f ^Sin<j>u(yuy2)dy^dy, 

A [(a;i -yiY +x\- 2x23/2 cos 4> + 



R2 



(4.9) 



(4.10) 



In [20] the operator was studied in weighted L^-spaces on the half-plane 
E^. In particular, it was shown that 







Moreover, the function (j) 1 — > is continuous in the operator norm topology. 

This yields the following two theorems. 

Theorem 4.5. Let be rj < 00 . Then the boundary integral operator for the 
Dirichlet problem, —IaKd, is a Fredholm operator with index zero in 1/2(5). 
The essential norm of the operator Kd is smaller than one. 

Proof. For all a; G T the operator —I + Kdis quasi-equivalent to invertible 
2 X 2-matrix operators on the half-plane. This remains true for the operator 
—I A uKd with u G [0, 1]. Since the index is homotopy-invariant we obtain 
that —I A Kd has zero index. □ 



Theorem 4.6. Let be t) = 00 . The boundary integral operator —I A Kd is a 
Fredholm operator with index zero in L 2 {S) if one of the following conditions 
is fulfilled: 
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(i) 



\ 1 — k ^ . / XM 2^/k 

^ _|e„sWx))| + — 



COS 



a{x) + ^{x) 





(a( \\\ . fa{x)+l3{x)\ 

l + tlc“(«*))l+i + /i <=“( 2 j 


|<1, (4.11) 


where A+ 


= kX-, k G (0, 1], 




(ii) 


a{x) = l3(x) , for all x e F , 


(4.12) 


(Hi) 


a{x) -f l3{x) = 7T for all x £ F . 


(4.13) 



The proof is the same as for the previous theorem using the results on the 
invertibility of the local operators on the infinite wedge (s. [20]). 

The assertion of the theorem remains valid if we replace the L‘2(5)-space 
by any Lp^^(5)-space with p> 2 and 0 < /> < 1/p, where 

Lp^p(s):={u£ dist{x, ryipis)} 

and dist(x, F) denotes the distance of x E 5 to the edge F. 

We note that the existence theorem of the previous section is not ap- 
plicable since an L 2 -density in the double layer potential does not produce 
a -solution of the boundary value problem. Nevertheless, the following 
uniqueness result holds. 



Theorem 4.7. The operator — /H- Kd is invertible in L 2 {S). 

Proof, Since the operator is Fredholm with zero index we have only 

to show that its adjoint has a trivial kernel. Let / € L‘ 2 {S) be a nontrivial 
function with (— / -h = 0. Then the function 



v(x) ^ / G{y,x)f{y)dSy 



is a solution of the following outer Neumann problem 

Av = Q in (M^ \ D+) U (ll \ D_) (4.14) 



x_i 1 dv 

tj(Ai 7 +d - Aq 7 _v| = -7+:^ at 5i 



dv 



dxs 

dv 



dxz dxz 



dv 



1-^ = 0 
dn 



at 5i 
at S, 



(4.15) 

(4.16) 

(4.17) 
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with 5i = {xs = 0} \ So- It follows from the mapping properties of the 
single layer potential that the function v belongs to the space in the 

subdomains \ D+ and Ei \D-. We consider a ball Br with sufficiently 
large radius R and apply Green’s formula to the domains 

B+ := (Rl\D+)nBR 

and 

:= (El \ 5_) n . 

Then for the gradient of v we get 



Igrad 



aBHOR®. aBsoRi 



,-i f dv ^-l dv 



vdS H- 



S'iDBh SiC\Bji 

The first two integrals are asymptotically 0{R~^). The third and fourth 
integral vanish due to the homogeneous boundary condition. For the sum of 
the remaining two integrals we obtain 

S\r\Bn S\r\Bji 

= / hfel 

S\nBji 

whence v = const and since v{x) -> 0 for x -)> oo, ?; vanishes identically 
in the outer domain. The continuity of the single layer potential through S 
implies that v vanishes in the interior domain, too, i.e. it is zero everywhere 
which yields that / vanishes at S. This contradicts our assumption and the 
proof is finished. □ 




Bimetal Problems 



93 



We note that the conditions (4. 11), (4. 13) imply that the essential norm of 
the operator Xd is smaller than one if the space L 2 {S) is endowed with the 
modified norm generated by the inner product (.,.)a- Under the condition 
(4.12) this in generally not true. 

For the numerical treatment of equation (4.8) we need some additional 
weighted Sobolev spaces. For A: G N and p G [0, 1/2) we define the spaces 



X^{Rl) := {u e x^L 2(K^) : G L2(K+), 1 < |a| < A;} 

r;(E^) := {u € ■ 4~"Di,Di^u 6 HRl), l<i + j<k] 

on the half-plane M^. The space Xp(Mi^) (= Fp^(M^) will be denoted by 
L 2 ,p(M^). The associated Sobolev spaces on the surface S are defined either 
by choosing a partition of unity and a family of diffeomorphisms or directly 
in the following way. Let dist (x, F) be the geodetic distance between a point 
at S and the edge F. Then, the definition of the spaces Xj^{S) and ¥^{8) 
is the same as above if we replace the weight function X 2 by dist (x,JT) and 
the differential operators by an appropriate system of tangential differential 
operators. 

For the infinite wedge problem the following regularity result holds. 

Theorem 4.8. Let the operator Ad be invertible in L 2 ^^(E^) := L‘ 2 ,p(M^) x 
L2,p(M^). Then it is invertible in (X^)^(E^) and (F^^)‘‘^(E^) for any fc G N. 

Proof. We will show that the operator maps the space L 2 ,p(Ei^) (= 
X^{R\)) into X^(E^) for any A: G N. Let be A: = 1. We have to show 
that the functions X 2 Dx^G(f^v and X 2 Dx^G(f^v belong to L 2 ,p(E^^). The kernel 
functions of these two integral operators are linear combinations of functions 
of the form 



k{xi -y\,X2,y2) - 



(a^i - yi)'‘x^2yT 

[(xi — J/i)^ + x| — 2 x 23/2 COS^ + y|](”»+*+*+2)/2 



with natural numbers m,k,l. Theorem 2 in [13] gives the boundedness of such 
operators in L 2 ,p(E^). The cases A: = 2,3, ... can be done in the same way. 
The mapping property together with the invertibility in L| ^(E^) yields now 
the invertibility in the X^-spaces. To prove the second part of the theorem we 
first note that the operator commutes with the operator of differentiation 
with respect to the xi -variable. This is due to the convolution structure of 
the operator G<^ with respect to the first variable. We denote by u the partial 
Fourier transform of u with respect to x\ (the co- variable of a:i is denoted 
by ^). Then the invertibility of Ad is equivalent to the invertibility of the 
operator function Ad(0 ■— Xxi-^^Ad- Let us consider the equations 






(4.18) 
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where / G For / = 0 (4.18) has a unique solution v G 

Since G .F ((L 2 ,p)^(M^)), we obtain G .F ((L 2 ,p)^(M^)) for I > 1. 
The proof of the first part of the theorem shows now that belongs to 

T ((F^^)^(E^)) which finishes the proof. □ 



5 Meshes and Spline spaces 

It is well-known that the solution of the boundary integral equation has a 
singular behavior at the edge even if the right-hand side is arbitrary smooth. 
This implies that the convergence of standard spline approximation methods 
is only of low order. To obtain similar results ats for the approximate solution 
at smooth boundaries we have to adapt the method to the asymptotics of the 
solution. This can be done either by adding some special singular functions 
to the space of trial functions or by a special mesh grading. While the first 
method does not seem to be very applicable in the three-dimensional situa- 
tion, since the exact asymptotic behavior of the solution has to be computed, 
the mesh grading is often used in the literature (s. [6], [24]). 

For q> 1 we define a sequence n G N, of graded meshes on the boundary 
S. We first cut off two strips F+, F_ of width one on both sides of the edge F. 
Taking closed curves with distance {i/ny, ^ = 1? n — 1, to F on both strips, 
we obtain a graded partition of the boundary near the edge. Furthermore, 
we choose a uniform partition of F with mesh-size of the order 1/n. Taking 
curves from these mesh points in the normal direction to F, we get a quad- 
rangulation of the strips , F_ . Dividing each quadrangle into two triangles 
by non-intersecting diagonals and choosing a regular family of partitions into 
triangles of diameter of the order of 1/n for the remaining two parts of the 
boundary, we finally obtain our graded mesh An- We assume that for any 
triangle G An there is a smooth bisection tt^ on a reference triangle fi in 
the plane. 

For a sequence of triangulations An we define the spline spaces Sd{An) con- 
sisting of the pre-images of all piece-wise polynomials of total degree not 
greater than d subordinate to An. Let Pn denotes the orthogonal projection 
of L 2 {S) onto Sd{An). We note that the sequence {-Pn} converges strongly 
to the identity. 

The Galerkin method for the approximate solution of equation (4.8) is the 
following: Find Un € Sd{An) such that 

PnAUn = Pn9 • (5.1) 

The stability of the method (5.1) is closely connected to the stability of 
the Galerkin method for the approximate solution of the boundary integral 
equation for the infinite wedge problem. Therefore, we define a associated 
triangulation of the half-plane . On the strip F = {x : 0 < X 2 < 1} we 
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take parallels to the xi-axes at the distance (i/n)«, i = \...n. The remaining 
part of the half-plane is divided by parallels with the uniform distance 1/n. 
Taking half-lines in the normal direction to the xi-axes with distance 1/n we 
finally get a rectangular grid with an anisotropic mesh grading with respect 
to the boundary of the half-plane. Lifting that partition to the boundary of 
the infinite wedge and dividing each rectangle by nonintersecting diagonals 
we finally obtain the grid An- The spline spaces will be defined as for the 
bounded surface. The associated Galerkin projection will be denoted by P„. 
We note that the Galerkin method 

PnAun = Pn9 (5.2) 

in the case of the infinite wedge has an infinitedimensional system matrix. 
Therefore, for practical computations a finite section method should be ap- 
plied at first. 

We finish this section with a well-known result about the approximation 
of functions in Sobolev spaces by polynomials (s. [27]). Let r be a triangle in 
the plane. We put 

h* =sup{|xfc : x, 2 /er}, A; = 1,2, 
and for any multi-index a = ( 01 , 02 ) we set h°“ 

Proposition 5.1. Letd&N and f £ Then there exists a polynomial 

p of total degree less than d such that 

ll/-p|U,(r)<C^ h“||£)“/llMr) (5.3) 

\a\=(i 



6 stability and error estimates for the Galerkin 
method 

This section is devoted to the L 2 -stability of the Galerkin method (5.1) and 
the associated method on the infinite wedge. For rj < oo the essential norm 
of the double layer potential operator is less than one which implies the 
L 2 -stability of any Galerkin method. This remains true for 7^ = oo if the con- 
ditions (4.11) or (4.13) are fulfilled. Under the condition (4.12) the operator 
is not strongly elliptic, in generally. We consider the Galerkin method for the 
infinite wedge with a = /?. 

Theorem 6.1. Let be a = (3, then the Galerkin method (5.2) is stable. 

Proof. We have to show that the sequence {Ai^d Ai} is stable, where Ad is 
given by (4.10) with a — (3. Since all entries of the matrix operators in this 
sequence commute with each other, the determinant is well defined. So the 
stability is equivalent to the stability of 
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{ det {PnADpn)} = [PnlPfi “ PnG 2 a PnG 2 a Pn} 

which is obvious since the operators of this sequence are even invertible on 
the image of Pn- D 

From the previous theorem follows now the stability of the Galerkin 
method (5.1). 

Theorem 6.2. Let be rj < oo or let be rj = oo and the invertibility conditions 
(4‘^^) or (4-^3) are fulfilled. Then the Galerkin method (5.1) is stable 
in L 2 {S). 

It follows in a standard way from the stability theorem that the Galerkin 
error is equivalent to the error of the best polynomial approximation, i.e. 



11^ - ^n||L 2 ( 5 ) < C||(/ - Pn)u\\L 2 {S) for all U > Uq. (6.1) 

This reduces the question of the convergence rate to a question of approxi- 
mation. 



Theorem 6.3. Let the assumptions of Theorem 6.2 be fulfilled. If g £ 
y^rf-fi(^)^ 0 < p < 1/2, and the grading parameter fulfils q > (d -f l)/p, 
then the Galerkin method (5.1) converges with the error estimate 

11^ - UnWi^iS) = 0(n"^"^), n -> 00 . (6.2) 

Proof. Under the assumptions of the theorem the solution u belongs to 
the space We have to show that the best L 2 -approximation of an 

element u € Yp^^{S) by splines from S^{An) is of order if the grading 

parameter satisfies q > {d-\- l)/p- It suffices to prove that for any triangle r 
of the mesh there holds 

||(/-Pn)w|U,(r) Y. \\dist^~^ix,r)D\Diu\\L,^r) (6.3) 

i-\-j<d-hl 



where D\ and D 2 denote locally acting tangential differential operators in the 
edge direction and normal to the edge, respectively. First, let r be a triangle 
from the strip of unit width around the edge such that dist(r, P) = 0. It 
follows from the uniform boundedness of the projections Pn that 



IK^-'Pn)wllL(r) <c||«||i,(^) 



< J dist^'’(x,r) |dist^(x, r)w|^d5 



'^2,p(r) 



which gives the desired order since qp> d + 1. 

Let now r be a triangle from this strip with positive distance to the edge. 
It follows from proposition 5.1 that 
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\\{I-Pn)u\\L,ir)<c\\u\\ h\hi\\D\Diuh,^,) 

i+j=d+l 

<c ^ /i*i/i^dist'’“^(r,r)l|dist^"^(x,r)D5D^u||x,^(^) . 

i+j=d+l 



By the definition of the grid , we have h\ ~ 1/n* and there is an index k such 



that 



1 



kyip-i) 1 

- < c— 

n J 



which implies (6.3). 

If r belongs to the remaining parts of the boundary we apply the proposition 
5.1 again and obtain 



|a|=rf+l 



We note that the partition is uniform and the norms of the spaces Yp{T) and 
are equivalent to each other. Combining all estimates we obtain the 
convergence result of the theorem. □ 
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Analysis of 3D Elastoplaistic Notch and Crack 
Problems using Boundary Element Method 

G. Kuhn and P. Partheyrniiller 



1 Introduction 

It is a permanent challenge for designers and engineers in mechanical en- 
gineering to design technical structures appropriate to their future use and 
to correctly assess how safe they are against failure. In order to solve this 
problem, basically three points must be considered: 

- How can the behaviour of a given material be described ? 

- Which mechanisms of failure does this material show, and which are the 
corresponding quantitative criteria of failure ? 

- How can stresses and strains at every position of the structure be deter- 
mined effectively, taking into consideration the known material behaviour 
and the external load, and how can the characteristics for the underlying 
failure be obtained ? 

Due to the complex geometries and the loads imposed on them, evaluation 
of stresses and strains is only possible using numerical methods, even if the 
simplest material behaviour - linear elasticity - is considered. In practice, 
mainly the Finite Element Method (FEM), the Boundary Element Method 
(BEM) and the Finite Differences Method (FDM) are applied. 

For linear elastic material behaviour, BEM has meanwhile developed into 
a real alternative to FEM. Owing to transformation of the problem to the 
boundary, time required for pre- and post-processing of the computational 
models, especially for three-dimensional problems, is reduced considerably. 
Furthermore, experience has shown that high gradients of field quantities, 
occurring for example in case of stress concentration, can be approximated 
very exactly. 

If BEM is applied in the framework of time-independent small strain 
elastoplasticity, the advantage of discretization in BEM in comparison to the 
volume-orientated methods FEM and FDM is largely preserved for problems 
of local plasticity, which are of great importance in mechanical engineering 
(for example notch problems). 

Apart from the standard proof of strength, it may be necessary for safety- 
relevant structures in mechanical engineering to do an assessment in terms 
of fracture mechanics, especially when cracks occur due to manufacturing or 
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during operation. Since a crack is the most stringent form of stress concen- 
tration, BEM has proved to be a powerful method especially in linear elastic 
fracture mechanics (LEFM), but also in nonlinear fracture mechanics. 

At the beginning of the project the Boundary Element Method provided 
promising results in two-dimensional linear as well as nonlinear continuum 
and fracture mechanics [39, 31, 38] and for linear elastic applications also 
in 3D [26]. Thus the main aim of this project was to apply BEM to three- 
dimensional elastoplcistic notch and crack problems and to realize it in terms 
of a computer code, because efficient numerical methods are of great practical 
interest especially for those problems. During the last period of granting 
this project, the aim was extended to investigate the potential of Boundary 
Element Method for numerical simulation of fatigue crack growth in 3D. For 
definite decoupling of crack surfaces, the formerly used subregion technique 
was replaced by a more efficient method, the DUAL-BEM [36]. 



2 Basics of Continuum Mechanics 

A three-dimensional, homogeneous, isotropic body (domain i?, boundary F) 
with linear elastic or inelastic material behaviour is considered. It is loaded 
as shown in Fig. 2.1 in i? with body forces bj, on Ft with tractions tj and on 
Fu with displacements uj. 




2.1 Elasticity Theory 

The theory of linear elasticity is based on the following equations: 

equilibrium conditions : 4- = 0 , aij = aji , tj = ajiUi 

kinematic relation : £ij — ^(uij 4* Uj^i) 



constitutive relation : 



(Tij — 2G ^ij "I" i—2u j ^ijkl ^kl • 

( 2 . 1 ) 
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Cijki is the elasticity tensor with the material constants shear modulus G, and 
Poisson^s ratio i/, rii is the outward directed normal vector of the boundary P. 
The usual elimination of stresses dij and strains Sij leads to the well-known 
Lame-Navier^s equation 

"h I — 2iy ~ ~ ~G ^ 

which, in combination with the boundary conditions uj oni^ and ij on Ft, 
completely describe the elasticity problem. 



2.2 Plasticity Theory 



Time-independent, elastoplastic material behaviour in the framework of small 
strain plasticity is assumed. Here, the total strain tensor 

^ij = ^ij ( 2 - 3 ) 

can be split up additively into an elastic and a plastic component. Analo- 
gously to elasticity (see equations (2.1)), the equilibrium conditions and the 
kinematic relation as well as Hooke’s law for the elastic component of the 
strain tensor are valid: 



'tj 



2G 






Cijki ^i) 



(2.4) 

If again stresses and strains are eliminated, one obtains a generalized form of 
Lame-Navier’s equation 



1 hj 

+ 1 _ 2i/ “ G 


with 


Uj 

^3 


rr 


O O 


(2.5) 


and the pseudo body force vector 












h = bj - 2G 


. -k 


V 






(2.6) 


l-2i/ 


5 


which contains the influence of the nonlinear material behaviour. 





For complete description of the problem, additionally the constitutive 
equations for time-independent elastoplastic material behaviour must be pro- 
vided. For this purpose, the internal variables ^ = (^i,$ 2 , •••^n) are intro- 
duced. Together with the variables stress and temperature, they describe the 
current state of the material. For the internal variables, suitable evolution 
equations are necessary and must be available. 

For time-independent, incremental plasticity, the flow law usually consists 



of 
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- the yield criterion: 



F{a,T,^) = 0 , 



- the flow rule: (2-7) 

- the consistency condition: cZA F = 0 , cZA > 0 mit F < 0 , 

written in Kuhn-Tucker form. F denotes the yield function, d\ denotes the 
plastic proportionality factor, and Q denotes the plastic potential. In case of 
associated plasticity, the yield function is considered as the plastic potential. 

The classic yield functions (Tresca, von Mises, Drucker-Prager, Mohr— 
Coulomb) were applied both for kinematic and isotropic hardening by Dallner 
in his PhD-thesis [9]. In this thesis, the phenomenological and physical basics 
as well as the respective literature are given. 



3 Basics of Fracture Mechanics 

3.1 Linear Elastic Fracture Mechanics 



In fracture mechanics, there are three basic modes of crack opening - Mode 
I, Mode II and Mode III. Every mode creates an independent asymptotic 
crack near-field in an elastic body with a crack. Superposition of all three 
modes leads to the general three-dimensional crack near-field. In the crack 
front coordinate system (see Fig. 3.1), the components of the asymptotic 
displacement field 



Ui 






and of the asymptotic stress field 



(Tij — 






y/2iTr * 



y/2'Kr 






(3.1) 

(3.2) 



can be described in dependence on a local polar coordinate system (r, cp) [31]. 
For r -> 0 the asymptotic stress field shows a l/y/r- singularity. Its intensity 
is described by the stress intensity factors (K-values) Kj^Kn and Km for 
the three modes of crack opening. An exception to this are those points of 
the crack front located on the free surface of the cracked body. Here, an edge 
singularity is superposed. 

If under single-mode or mixed-mode load the K-value or a representative 
K- value = Kv(F/, K//, fC///) exceeds a critical material-specific value 

{fracture toughness)^ crack propagation becomes instable. Fracture criteria 
are then 



Ki > Kic for i = /,//,/// and Ky > Kyc {Kjc, Knc, Kuic)- (3.3) 
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In the following, the path independent J-integral (Rice) is investigated. 
In LEFM it corresponds to the energy release rate (Griffith, Irwin), but can 
be also extended to nonlinear material behaviour in the framework of defor- 
mation theory. Within the LEFM for 3D problems, the J-integral on a point 
s of the crack front can be written as [33]: 

Jx^is) = J {Wni - aijrijUi^i)dr - j {a A . (3.4) 

r A(r) 

Here, nj denotes the outward directed normal vector to the contour F copla- 
nar with A{F) (see Fig. 3.1). W denotes the elastic strain energy density. 
While in 2D, the J-integral can be described by the contour integral alone, in 
3D additionally a surface integral must be considered. This surface integral 
includes both the singularity on the crack front and the boundary values on 
the crack surfaces. 




Fig. 3.1. 3D crack problem: contours and area configuration for J-integral 



Within the crack near-field, the J-integral can be decomposed into the 
three modes of crack opening (J^^ = Jj -j- Jjj 4- Jjjj) [28], [34]. Then the 
K-values 

Ki = Kn = Kill = ^/2Gli7I (3.5) 

YAC-fl VAC-hl 

can be determined. G denotes the shear modulus, and ac = 3 — 4z/ stands for 
the plain strain state. 

Within the crack near-field, not only the J-integral itself but also the three 
modes J/, J/j, Jj// are path independent. In the far-field, path independence 
can be proved only separately for the symmetric and the antisymmetric part 
of the J-integral [20]. 
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3.2 Nonlinear Fracture Mechanics 



In the framework of the deformation theory, due to Carpenter et al. [7] one 
obtains the J-integral 

J{s) = hwni-(TijnjUi^i)dr- j ((TisWi.i ),3ci^+ / 

^ Mr) A(n 

(3.6) 

for 3D crack problems, which is also path independent. The strain energy 
density {W = 4- W^) consists of an elastic and a plastic component. 

Following Hutchinson [21] and Rice and Rosengren [37], in analogy to 
the linear elastic case singular crack near-field equations (HRR field) can be 
developed by application of the deformation theory. For material hardening 
in the form of a Power Law Strain Hardening stress-strain relation due to 
Ramberg-Osgood 




and on condition that near the crack front the elastic strains compared to 
the plastic strains can be neglected, the order of singular behaviour and the 
respective angle distribution depend on the hardening exponent N. In the 
crack front coordinate system (see Fig. 3.1) the HRR field can be written as 



aaoSolNi" 



dij{ip,N) 



= (To 



(3.8) 

(3.9) 



where In is dependent on N and is a known scaling factor. The J-integral 
describes the intensity of the singular fields. 



4 Boundary Element Method in Elastoplasticity 

For the sake of simplicity, body forces are neglected in the boundary in- 
tegral equations for elcistoplastic material behaviour. Furthermore, a rate- 
formulation is not necessary, because in correspondence to Section 2.2 time- 
independent small-strain plasticity is assumed. 

4.1 Boundary Integral Formulations 

In analogy to elasticity, for elastoplastic material behaviour the principle of 
weighted residuals can be applied. Thus, the generalized Somigliana identity 
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MO = j Uiji^,x)tj{x)dr - j Tij{^,x)uj{x)dr + j Sijk{Ox)£^jki^)dn 

r r Q 

(4.1) 

can be derived. Uij, Tij and Uijk are the well-known Kelvin fundamental 
solutions. The domain integral, which arises in addition to the linear elastic 
case, describes the influence of nonlinear material behaviour and is weakly 
singular. For this reason, only the strongly singular boundary integral with 
the kernel Tij must be considered for the limiting procedure. Thus, one ob- 
tains the elastoplastic displacement boundary integral equation 

CijiOujiO = jUijiOMjiOdr-J^ Tij{i,x)uj{x)dr+ j Eijk{i,x)e^j^{x)dn 

r r n 

(4.2) 

for source points on the boundary [1, 6, 9]. 

Analogously to elasticity an integral equation can be given also for dis- 
placement derivatives on source points inside the domain, if the generalized 
Somigliana identity (4.1) is differentiated with respect to the source point 
coordinate Due to the strong singularity of the additional domain integral, 
a thorough limiting procedure and a regularization has to be carried out. 
Here, one distinguishes indirect^ approximative direct and exact direct meth- 
ods of calculation, for which detailed literature studies can be found in the 
PhD-thesis of Dallner [9]. 




Fig. 4.1. Source point ^ located inside the domain 



A method suggested by Dallner, Kuhn [10] provides the formulation 



duiiO 

d^i 



= j - j ^Uj{x)dr + j ^^e^ji^{x)dnR 

r r f}R 

+ j - ^kiO^dfis - £%i0 j ^ijkni{^)dEs (4.3) 

rs 



for the displacement derivatives on source points inside the domain, if the 
limiting procedure e 0 (see Fig. 4.1), the Leibniz formula, a regularization 
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via Taylor series expansions and the Gauss theorem are taken into consider- 
ation. 

The advantage of this formulation is that the strongly singular domain 
integral is replaced by regular and weakly singular domain integrals as well 
as regular boundary integrals over Fg. These integrals can be evaluated both 
in 2D and in 3D using standard numerical integration schemes. If, in view 
of discretization of the domain, all cells which contain the source point ^ are 
summed up to Hg, the displacement derivatives on source points inside the 
domain can be determined efficiently for any cell types. 

An integral formulation for the displacement derivatives can be derived 
also for source points on the boundary. Due to the hypersingular boundary 
integrals, this had been a serious problem for a long time. After the hyper sin- 
gular integrals in the framework of linear ela^sticity had been solved [14, 19], 
Leitao et al. [29] for the first time presented a h 5 rpersingular formulation for 
elastoplastic material behaviour in 2D. The strongly singular domain inte- 
gral is evaluated directly by using the method of Guiggiani [14]. Parallel to 
this, Huber, Dallner, Partheyrniiller, Kuhn [18] extended the above derived 
formulation (4.3) to source points on the boundary and applied it to 3D 
elastoplastic problems. The advantage of this approach is that direct evalu- 
ation of the strongly singular domain integral, which is especially costly for 
three-dimensional problems, can be avoided. 

Starting from a formulation of the generalized Somigliana identity (4.1) 
where the source point is excluded from the domain (see Fig. 4.2), the limiting 
procedure e ^ 0 and the derivation to the source point coordinate ^ can be 
done separately for the boundary and the domain integral: 




Fig. 4.2. Source point ^ located on the boundary 



The first limit of equation (4.4) is the initial formulation for the derivation 
of the displacement derivatives at source points located on the boundary for 
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linear elastic material behaviour. For 3D problems, this was first applied for 
smooth boundaries by Huber, Lang, Kuhn [19]. Evaluation of the hypersingu- 
lar and strongly singular integrals is based on the Guiggiani [14] method. For 
this, CL^-continuity is required for the displacements at the source point. 

The limiting procedure for the domain integral in equation (4.4) can be 
done analogous to the source points inside the domain. When both parts are 
summed, the hypersingular formulation of the displacement derivatives for 
elastoplastic material behaviour can be rewritten as 



1 duj(C 

2 d^i 




d^i 



Uj(x) 



dr - 



(4.5) 



+ / + / ^ dOs 

On Os 




^ijuniix) dFsi 



4- lim 



r 






rsa-r. 



for a smooth boundary [18]. 

The techniques applied for numerical evaluation of all integrals occurring 
in the boundary integral equations for elastoplastic material behaviour are 
described in the report [25]. More details on this topic can be found in the 
articles [2, 10, 14, 18, 19, 35]. 



4.2 Elastoplastic Solution 

For numerical solution of the elastoplastic problem, both the geometry 
(boundary T, domain i?) and the relevant field quantities (displacements 
Uj, tractions tj, plastic strains must be discretized and approximated. 
This is done via discrete nodal values and interpolating shape functions. 

Although the domain must be meshed, the advantage of discretization for 
problems with local plasticity (notch and crack problems) in comparison to 
volume-oriented methods (FEM, FDM) is preserved to a large extent. Only 
those areas where plastification occurs must be considered. 

Using a collocation method^ the discretized form of the displacement 
boundary integral equation (4.2) is evaluated on the n boundary nodes. For 
3D problems one obtains a set of 3 n algebraic equations, which can be writ- 
ten in matrix form. Rearrangement according to the known boundary values 
y and the unknown boundary values x 

Au = Bt De^ -> Ax = By + De^ (4.6) 

provides a direct relation between the unknown plastic strains on the 
cell nodes and the unknown boundary values x. In comparison to the linear 
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elastic case there is additionally the inelastic part with the vector and the 
corresponding matrix D. This matrix consists of the regular and the weakly 
singular integral parts over the cells. 

Since within the flow law (2.7) the plastic strains are locally linked with 
the stresses, the stress tensor must be provided on every cell node. 

For cell nodes inside the domain, the formulation for displacement deriva- 
tives (4.3) in its discretized form can be applied. Via the kinematic relation 
(2.1) and the stress-strain relation (2.4) the stress tensor can be derived. 

There are three different approaches to determine the stress tensor for 
cell nodes on the boundary. The first is to use the shape function deriva- 
tives. This method is realized in the existing boundary element program for 
three-dimensional elastoplastic problems because it requires little time for 
computation and is well-known from elasticity. 

The other approaches are based on integral formulations of the displace- 
ment derivatives. Other than the local shape function derivatives, they can 
easily be extended to geometrically nonlinear problems and general stress- 
strain relations (see Foerster, Kuhn [11]). One approach is to evaluate equa- 
tion (4.3) for source points inside the domain located close to the boundary 
to avoid the evaluation of hypersingular integrals. This is more or less a 
heuristic method but it works if there are adaptive integration algorithms 
to determine the arising nearly singular integrals. The other approach is to 
evaluate the hypersingular identity for the displacement derivatives (4.6) in 
its discretized form. Since this equation is available for smooth boundaries 
only and additionally, (7^ ’"-continuity of the displacements is required, at the 
moment domain discretization and approximation of plastic strains must be 
selected in such a way that the nodal points of the shape functions of 
are not located on edges, corners or boundary element edges. This can be 
done eflficiently by decoupling of boundary and domain meshes together with 
introduction of discontinuous cells to avoid edges and corners (see Herding, 
Kuhn [15]). 

When evaluating the stress tensor at all cell nodes using the above meth- 
ods and rearranging the matrix equation with respect to the boundary value 
problem 

cr — Hu-\-Gt + De^ -> a =Ax-\-By + De^, (4.7) 

one receives a direct relation between the stress tensor, the boundary values 
and the plastic strains. Elimination of the unknown boundary values x using 
equation (4.6) supplies the stress tensor on all cell nodes 

cr = {AA-^B + B)y + {AA~^D + D) + K (4.8) 

in dependence on the external load y on the boundary nodes and the plastic 
strains on the cell nodes, cr^ can be identified as that part of the stress 
tensor a which would occur under the current load y at linear elastic material 
behaviour. K is the plastic corrector term. 
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For consideration of loading history, Equation (4.8) must be rewritten as 
an incremental formulation, and external load has to be applied in steps. This 
in combination with the flow law and the corresponding evolution equations 
results in a nonlinear set of equations which must be solved iteratively for 
every single load step. Due to this, after each load step the accumulated 
plastic strains and the stress tensor on the cell nodes are known. From this, 
the unknown boundary values can be determined via relation (4.6). In a post- 
processing procedure, displacements and stresses on arbitrary source points 
inside the domain and on the boundary can be determined via the boundary 
integral equations provided in Section 4.1. 

A detailed description of the elastoplastic algorithm and numerical exam- 
ples can be found in the PhD-thesis of Dallner [9]. 



5 Crack Problems and Boundary Element Method 

5.1 Crack Modelling in the Boundary Element Method 

In order to derive stresses and strains in a cracked body by using Boundary 
Element Method, the crack surfaces must be deflnitely decoupled. Basically, 
there are three approaches: Application of Green’s fundamental solutions [40, 
31, 22, 38], subregion technique and the DUAL BEM. 

If Green’s fundamental solutions are applied, neither in linear elastic nor 
in elastoplastic cases discretization of the crack surfaces with boundary el- 
ements is necessary; application is, however, possible only for 2D problems. 
Recently, Telles et al. [41] introduced an extension to 3D problems in which 
Green’s solutions for linear elastic material are determined numerically. The 
advantage that crack discretization is not necessary, however, cannot be main- 
tained. For this reason, the subregion technique and the DUAL-BEM were 
realized in the framework of this project. 

5.1.1 Subregion technique 

The subregion technique, introduced in the Boundary Element Method by 
Lachat and Watson [27], was first used by Blandford et al. [5] for decoupling 
of crack surfaces. When the cracked body is split into two subregions at the 
ligament (see Fig. 5.1), the crack surfaces are deflnitely decoupled in separate 
domains. Using subregion technique BEM can be applied to both linear elastic 
and elastoplastic crack problems [9, 17]. 

Another advantage of the subregion technique is that a block diagonal 
structure of the system matrix is generated. This leads to a considerable 
reduction of computing time if suitable block solvers are used [3, 4]. Since 
different material behaviour can be assigned for each subdomain, interface 
cracks can be modelled, too. In nonlinear cases, a multiple node concept is 
necessary for the plastic strains [9]. 
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Fig. 5.1. Decoupling of crack surfaces by subregion technique 

A drawback is that the subregion interface intersects the cracked body in 
that zone where the highest stress gradients occur. There are two approaches 
to approximate the singular crack near-field well. Either the discretization is 
refined towards the crack, which is very costly in terms of time, or, following 
FEM, special elements are used to describe the singular crack near-field. 

In linear elastic cases, this means for boundary elements along the crack 
front that on the crack surfaces the shape functions of the displacements have 
to reproduce yF-behaviour, and that the shape functions of the tractions on 
the ligament have to reproduce l/^r-behaviour. Cruse and Wilson [8] were 
the first ones to use the quarter-point elements known from FEM with the 
respective extension for approximation of tractions in 3D Boundary Element 
Method. Later Luchi and Poggliani [30] introduced crack front elements with 
special shape functions and a suitable regularization for the approximation 
of the tractions in 3D BEM {traction singular mid-point elements). 

For elastoplastic problems, singularity in the HRR field depends on the 
hardening exponent (see Section 3.2). For this reason, the traction singular 
mid-point elements were used and extended to general singularity orders. Ad- 
ditionally, singular volume elements were derived for approximation of plastic 
strains around the crack front. Detailed description of the shape functions of 
these singular boundary and volume elements as well as the respective nu- 
merical integration can be found in the reports [23, 24] and the PhD-thesis 
of Huber [17]. 

The most apparent drawback of crack modelling using subregion tech- 
nique is that it requires more time and effort for discretization. This is espe- 
cially costly when multiple cracks are modelled and when remeshing becomes 
necessary in case of crack propagation. 

5.1.2 DUAL BEM 

The idea to use different boundary integral equations for definite decoupling 
of the crack surfaces was introduced by Watson [42] for elasticity and by 
Gray and Giles [12] for potential problems. Hong and Chen [16] first applied 
the traction equation as the additional boundary integral equation for crack 
modelling. This technique was taken up by Portela, Aliabadi and Rooke [36], 
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who created the name Dual Boundary Element Method {DUAL BEM) and 
used this method for 2D crack growth simulation. There are also applica- 
tions [32], [13] for 3D elastic crack problems and in a recent paper an exten- 
sion to elastoplastic material behaviour in 2D [29]. 

Boundary Integral Equations on Crack surfaces 

Both the boundary integral equation for displacements and for tractions for 
linear elastic material behaviour can be derived on the crack surfaces ana- 
logous to that for source points located on the non- cracfced boundary (see Fig. 
5.2). If a formulation where the source point is excluded from the domain is 
assumed and if the boundary on the source point is smooth, the limiting 
procedure e — >> 0 can be evaluated separately for each crack surface. With 
respect to a well-defined description, all parameters of one crack surface are 
marked by the upper index n, those of the opposite surface are marked by t. 




Fig. 5.2. Source point located on non-cracked boundary or crack surface 

Thus the strongly singular displacement boundary integral equation for 
a source point located on a crack surface can be written as 

+ + J Tij{^,x)uj{x)dr{x) = J Uij(^,x)tj{x)dr{x) 

r-r^-r* r 

(5.1) 

with r = r -f -h r^. The two integral-free terms for the displacements on 
the one and on the opposite crack surface with the corresponding strongly 
singular integrals are characteristic. From equation (5.1) one recognizes im- 
mediately that evaluation on both crack surfaces leads to an identical result. 
For this reason, this equation for itself is not sufficient for crack modelling. 

In this case also the traction boundary integral equation is necessary 
which can be obtained from the formulation for the displacement derivatives 
on the crack surfaces. The limiting procedure (see Fig. 5.2) and the regu- 
larization of the strongly singular and hypersingular integrals (see Section 
4.1) can be done separately for each crack surface. Multiplication with the 
elasticity tensor CaUi ^nd the normal vector n&(^^) at the source point on 
the crack surface provides the traction boundary integral equation for F^ 
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Uaj = -^{^,x)Camnt{e) and f^j = (5.3) 



Again, the two integral-free terms are characteristic. Since the normals 
on the two crack surfaces are directed opposite to each other, one obtains the 
traction vector with negative sign. The two divergent parts 6aj(C) 
baji^^) must be treated together with their corresponding hyper singular in- 
tegrals and cancel out [14, 19]. For validity of this formulation, -continuity 
must be assumed for the displacements. 



Numerical solution 

In DUAL BEM, for linear elastic problems the complete boundary of the 
cracked body including the two crack surfaces is discretized within one sub- 
region and the boundary values are approximated via shape functions. In 
the framework of a collocation procedure, the evaluation of the displacement 
boundary integral equation (5.1) at all nodes on evaluation of the traction 
boundary integral equation (5.2) on F^ and evaluation of the standard dis- 
placement boundary integral equation on the remaining non-cracked bound- 
ary F leads to a system of algebraic equations Au = Bt (see Section 4). 
After rearranging according to known and unknown boundary values, one 
obtains the linear system of equations Ax = b to determine the unknown 
boundary values. 

Due to symmetry of the kernel Tij on crack surfaces, it is not possible to 
compute the strongly singular integrals in equation (5.1) on the crack surfaces 
by using the rigid body condition [36]. For this reason, all strongly singular 
and hypersingular integrals on the crack surfaces from equations (5.1), (5.2) 
are evaluated directly by using the method of Guiggiani [14, 19]. 

In order to meet the continuity requirements for the hypersingular inte- 
grals and to solve problems also with kinked cracks, the crack surfaces are 
meshed using discontinuous elements (see Fig. 5.3 a). In case of edge cracks, 
edge- discontinuous elements for surfaces intersecting a crack surface together 
with node-discontinuous elements at the crack front are used (see Fig. 5.3 
b). Using this modelling strategy for the cracks the non cracked-boundary 
can be discretized with continuous elements to keep the number of degrees of 
freedoms as low as possible [25]. In the existing boundary element program 
quadratic quadrilateral elements are implemented. 
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Fig. 5.3. Discretization: a) crack surface - b) edge crack 

Although the number of degrees of freedom increases because of the dis- 
continuous elements, it is, however, the most practicable method to deal with 
hypersingular boundary integral equations within a collocation solution pro- 
cedure. 



Extension to elastoplastic material behaviour 

The main point in the extension of DUAL BEM to elastoplastic material 
behaviour is to provide the dual boundary integral equations on the crack 
surfaces. Analogous to elasticity (equation (5.1)), one obtains 

J Tij{^,x)uj{x)dr{x) = 

r-r^-r* 



J ^ij (^5 ^) I'j {^) dr[x) -h J Eijk (^, x) {x) df2{x) (^-4) 

r n 

for the displacements, because the additional domain integral is only weakly 
singular (see Section 4.1). 

Derivation of the integral identity of the displacement derivatives on the 
crack surfaces is more complicated, because due to their singularity not only 
the boundary integrals but also the domain integral must be considered (see 
equation 4.4). For the boundary integrals on the crack surfaces, the limiting 
procedure and the regularization is the same as for elastic cases (equation 
(5.2)). Following equation (4.6), the strongly singular domain integral of each 
crack surface (see Fig. 5.4) can be transformed into a weakly singular domain 
integral and into regular and strongly singular cell surface integrals. This 
leads to the hypersingular formulation of the displacement derivatives 




(5.5) 
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for source points on a crack surface in case of elastoplastic material. With 
the help of the nonlinear stress-strain-relation (2.4) and multiplication with 
the normal vector Equation (5.5) can be transformed into the traction 

boundary integral equation with respect to 



r 




Fig. 5.4. Source point located on a crack surface for an elastoplastic problem 



By decoupling of the crack surfaces with the dizo/ boundary integral equa- 
tions and use of discontinuous cells, 3D elastoplastic crack problems can be 
evaluated within one subregion (see Section 4.2). For 2D problems, this was 
introduced by Leitao et al. [29] for the first time. 
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5.2 Evaluation of Fracture Mechanics Parameters 

5.2.1 Stress Intensity Factors 

For direct evaluation of the stress intensity factors, the numerically deter- 
mined crack near-field of displacements or stresses respectively is compared 
with the corresponding asymptotic crack near-field (see (3.1) and (3.2)). In 
principle, there are two different approaches. 

In the extrapolation method^ Pseudo-K-values are computed from the 
numerically determined stresses and displacements on points along a straight 
line towards the crack front. From these values, the required K-value on the 
crack front is extrapolated. 

In the correlation method, special elements are required describing the 
V^-behaviour of the displacements and the l/^/r-behaviour of the stresses. 
From the nodal values of the boundary data on the crack front, the K- values 
can be identified directly. 

In the framework of this project, both methods were realized and tested on 
various numerical examples [17, 23, 24]. The subregion technique was used to 
decouple the crack surfaces. For evaluation of the crack near-field, the nodal 
values of the boundary elements on the crack surfaces and the ligament were 
used. This method requires very little time, but the drawbaqk is that the 
element nodes on the crack surfaces and the ligament used for extrapolation 
must be perpendicular to the crack front. This implies increased discretization 
especially if crack geometry is complex. 

In order to use the extrapolation method independently of the boundary 
element discretization and in view of DUAL BEM, where only the crack 
surfaces are meshed, the K-values were also evaluated via stresses determined 
on source points inside the domain. 

5.2.2 J-integral 

For 3D crack problems, the J-integral was evaluated both for elastic (3.4) 
and for elastoplastic material behaviour (3.6). Detailed documentation of 
the results with numerical examples can be found in the DFG reports [23, 
24] and the PhD-thesis of Huber [17]. Only the most important aspects are 
summarized here. 

For numerical evaluation of the J integral, the integration plane and the 
corresponding boundary contour are discretized with elements and (see 
Fig. 3.1), which are parameterized via shape functions. The field quantities 
to be integrated are also approximated using shape functions. If traction-free 
and plane cracks are considered, the contour integral on the crack surfaces 
does not contribute to the J-integral and therefore need not be discretized. 
For the surface elements around the crack front, singular shape functions are 
used, for all other elements standard quadratic shape functions. 

The required nodal values for stresses and displacement derivatives can be 
determined by evaluation of source points inside the domain for both elastic 
and elastoplastic material behaviour. For the nodal values on the ligament, 
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source points inside the domain located close to the boundary are used. For 
determination of the nodal values on the crack front for the singular shape 
functions, source points near the crack front axe evaluated. Via the singular 
shape functions, they are re-calculated to provide the values on the crack 
front. 

For elastic cases, the nodal values of the higher derivatives {ui^u ^ , (Tij^h) 
can also be determined via evaluation of source points inside the domain 
with the respective kernels. For plastic cases, this is not yet possible, because 
hypersingular domain integrals occur. For this reason, difference quotients 
are used here. The plastic component of the strain energy density can be 
integrated using the trapezoidal rule for each load step. 

With these approximations, each element and can be integrated 
numerically via Gauss-quadrature. Summing up the results, one obtains the 
J-integral at the point s on the crack front. For the linear elastic case, ad- 
ditionally the J-integral is decomposed into its modes, and the K- values are 
calculated. 

6 Conclusion 

In conclusion, there are three main scientific results obtained in the project 
over the period from 1989 to 1995. 

It could be proved [9, 10, 24] that the Boundary Element Method can 
be used successfully to solve 3D thermo-elastoplastic problems and that it is 
especially suitable for problems with local plasticity. 

By means of subregion technique for decoupling of crack surfaces, the 
Boundary Element Method could be extended to 3D elaistoplastic crack prob- 
lems [23, 24, 17]. Especially in LEFM, it is an efficient tool for exact determi- 
nation of fracture mechanics parameters such as stress intensity factors and 
J-integral. But also in the framework of deformation theory, the J-integral as 
a fracture parameter in the HRR field can be evaluated efficiently by using 
the Boundary Element Method. 

In order to develop the potential of the Boundary Element Method for 
numerical simulation of fatigue crack growth, a new technique for crack mod- 
elling was applied - the DUAL BEM. Apart from detailed incorporation of 
theoretical basics, a general formulation of the dual boundary integral equa- 
tions on crack surfaces could be developed for elcistoplastic material behaviour 
in 3D. In the framework of LEFM, DUAL BEM was realized in terms of a 
computer code for 3D problems. Its efficiency wats demonstrated by a bench- 
mark [25]. 
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Adaptive Domain Decomposition Methods for 
Finite and Boundary Element Equations 

G. Haase, B. Heise, M. Kuhn, and U. Langer 



1 Introduction 

The use of the FEM and BEM in different subdomains of a non-overlapping 
Domain Decomposition (DD) and their coupling over the coupling bound- 
aries (interfaces) brings about several advantages in many practical applica- 
tions. The paper presents parallel solvers for large-scaled coupled FE-BE-DD 
equations approximating linear and nonlinear plane magnetic field problems 
as well as plane linear elasticity problems. The parallel algorithms presented 
are of asymptotically optimal, or, at least, almost optimal complexity and of 
high parallel efficiency. 

The Domain Decomposition (DD) approach offers many opportunities 
to marry the advantages of the Finite Element Method (FEM) to those of 
the Boundary Element Method (BEM) in many practical applications. For 
instance, in the magnetic field computation for electric motors, we can use 
the BEM in the air subdomains including the exterior of the motor more 
successfully than the FEM which is prefered in ferromagnetic materials where 
non-linearities can occur in the partial differential equation (PDE), or in 
subdomains where the right-hand side does not vanish [23, 34]. The same is 
true for many problems in solid mechanics [31] and in other areas of research. 
A very straightforward and promising technique for the coupling of FEM and 
BEM was proposed by M. Costabel [7] and others [6, 33] In the different 
subdomains of a non-overlapping domain decomposition, we use either the 
standard finite element (FE) Galerkin method or a mixed-type boundary 
element (BE) Galerkin method which are weakly coupled over the coupling 
boundaries (interfaces) Pc. The mixed BE Galerkin method makes use of 
the full Cauchy data representation on the BE subdomain boundaries via 
the Calderon projector. 

The main aim of the project was the design, analysis and implementa- 
tion of fast and well adapted parallel solvers for large-scale coupled FE/BE- 
equations approximating plane, linear and nonlinear magnetic field problems 
including technical magnetic field problems (e.g. electric motors). To be spe- 
cific, we consider a characteristical cross-section, which lies in the (x, y)-plane 
of the R^, of the original electromagnetic device that is to be modelled. Let 
us assume that i?o C R^ is a bounded simply connected domain and that ho- 
mogeneous Dirichlet boundary conditions are given on Pd = dfio. Formally 
the nonlinear magnetic field problem can be written as follows [25] 




xen ( 1 . 1 ) 

( 1 . 2 ) 

(1.3) 
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-div (u{x, |Vu(x)|)Vtt(x)) = S(a;) + — 

OX dj 

u{x) = 0 , X £ Fd 

\u{x)\ 0 for \x\ ->► (X), 

with i? := \ Hq. The solution u is the z— component of the vector 

potential A = {Ax.Ay.AzY' introduced in the Maxwell equation. The com- 
ponent of the current density, which acts orthogonal to the cross-section being 
considered, is represented by S(x), whereas Hqx and Hoy stand for sources 
associated with permanent magnets that may occur and u{.) denotes a co- 
efficient depending on the material and on the gradient |Vu(x)| (induction). 
Now, we introduce the exterior domain by defining a, so called, coupling 
boundary T_|_ := The definition of F^ is restricted by the conditions 

u{x) = Up \/x e i7+, (supp SUsupp Ho) C i?_, diam(i7o U i?_) < 1, (1.4) 

where := \ (i?+ U i?o)- Note that the condition diam(i?o U i?_) < 1 

is only technical and can be fulfilled by scaling the problem appropriately. 
Besides the decomposition /} = U i7+, we allow the inner domain i?_ to 
be decomposed further following the natural decomposition of i?_ according 
to the change of data: 



Nm _ 

i7_ = (J with Hi nfij = ^ Vi ^ j. (1.5) 

In Section 2 (see also Appendix A), we present an automatic and adaptive 
domain decomposition procedure providing such a decomposition of f2 into 
p subdomains (p = number of processors to be used) and such controlling 
data for the distributed mesh generator [11] that we can expect a well load- 
balanced performance of our solver. 

In Section 3, we consider linear plane magnetic field problems for which a 
domain decomposition according to Section 2 is available. Now, we can make 
use of the advantages of a mixed variational DD FE/BE discretization and 
propose an algorithm for solving the linear coupled FE/BE equations. First 
of all, the coupled FE/BE equations can be reformulated as a linear system 
with a symmetric, but indefinite system matrix. We provide a preconditioning 
and a parallelization of Bramble/Pasciak’s Conjugate Gradient (CG) method 
[2] applied to the symmetric and indefinite system (3.12). The components 
of the preconditioner can be chosen such that the resulting algorithm is, at 
least, almost asymptotically optimal with respect to the operation count and 
quite robust with respect to complicated geometries, jumping coefficients and 
mesh grading near singularities (see numerical results given in Sect. 3 and in 
[30]). Using a special DD data distribution, we parallelize the preconditioning 
equation and the remaining algorithm in such a way that the same amount of 
communication is needed as in the earlier introduced and well studied parallel 
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PCG for solving symmetric and positive definite FE equations [17, 18] (see 
Appendix B). 

Section 4 is devoted to the description of the Full-DD-Newton-Solver 
for nonlinear magnetic field problems. In every nested Newton step we use 
basically the linear DD-solver given in Section 3. 

In Section 5, we apply our linear DD-solver to plane linear elasticity 
problems modelled by Lame’s system of PDEs. An appropriate adaptation 
of the components of the DD-preconditioner results in a parallel solver the 
efficiency of which is comparable to that of the solver for the linear magnetic 
field problems described in Section 3. 

In Section 6, we briefly describe the software package FEMQDBEM [14] 
and draw some conclusions. All numerical results presented in this paper 
were obtained by the use of the package FEMQDBEM. The code runs on 
various parallel computers and programming platforms including PVM (see, 

e.g., [ 28 ]). 



2 Adaptive Domain Decomposition Preprocessing 



2.1 The DD-Data Partitioning 



In this section, we focus our interest on how a decomposition of the domain 
fl into a given number of subdomains can be obtained from the natural 
decomposition into domains according to the change of materials (1.5). We 
are interested in well load-balanced decompositions especially in the case of 
discretizations which are adapted to singularities. 

We assume that a triangular-baised description of the geometry of the 
problem under consideration is given. Besides the geometrical data each tri- 
angle is characterized by a parameter pointing to that of the ATm material- 
regions the triangle belongs to. Note that interfaces between different materi- 
als, i.e. the boundaries of the f7j’s (cf. (1.5)), are represented by edges of the 
triangulation. We are interested in a decomposition into p > Nu subdomains 



j7 = where i7p := i?+ and tlj = [^ Hi Vj = 1, . . . , Nm 

( 2 . 1 ) 

where the sets of indices are given by X := {1, . . . ,p} and 

Nm 

Ij Cl* :={!,. IjnIk=<D yj^k, 



i.e., the subdomains Oj determined by the materials may be decomposed 
further (see, e.g., [19]). We assume that there exist open balls and 
{i E X^) with positive radii Zi smd r^, such that B^v C Qi C Bj. and 0 < 
Vi E X^ with fixed (i-independent) constants c and c. Note, in 
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the case of Q being bounded we had I* := {1, . . . ,p} and in the following all 
terms induced hyp, which then stands for the exterior domain, would vanish. 

Although the algorithm being used for decomposing SI is based upon a 
given triangulation of the domain it is of advantage to use a special DD-data 
structure as input for DD-based algorithms running on massively parallel 
computers. Thus, starting off with a given triangular-based geometrical de- 
scription (*.tn-file) we wish to end up with a well-balanced decomposition 
of our problem which is described by using some DD-data format (*.dd-file). 
Figure 2.1 shows the interactions between the preprocessing codes Decomp, 
TVi2DD and AdapMesh and the file-types *.tri, *.dd, and *.fh being involved. 

In the simplest case the process starts, on the left of the diagram, with 
applying Decomp to a *.tn-file which results in a decomposition as defined 
in (2.1), i.e. each triangle is assigned to one of the S2i’s, i £ I. The output 
of this process is also a *.fn'-file which then is converted into a *.dd-file by 
applying the program Tri2DD. In our case, such a *.dd-file is the input 
for the parallel code FEMQDBEM [14]. Because of the simple structure of 
the DD-data format being used it is quite easy to implement additional re- 
finement information concerning, e.g., singularities into the *.dd-file. In the 
latter case the mesh created from this file may differ significantly from the 
one described by the original *.tn-file. As a consequence we have to expect 
a bad load balance. This problem can be solved by applying the program 
AdapMesh which simulates the mesh generation as it occurs in the paral- 
lel program using, optionally, adaptivity information which may be obtained 
from a coarse-grid computation. Thus, AdapMesh creates a *.tn-file which 
is the input for restarting the cycle with Decomp. Note, in the optimal case 
with respect to the load balance the mesh used for computations (i.e. the one 
created from a *.dd-file within FEMC®BEM ) would coincide with the mesh 
being used for the decomposition. 




Fig. 2.1. The components of the preprocessing. 
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2.2 A Short Description of the Preprocessing Codes 

At this place, we are going to explain the codes and the main ideas they 
are based on. More information and technical details can be found in the 
documentation [12]. First we give a short description of the codes. 

Decomp decomposes single-material domains using the spectral bisection 
method (sbm) [42]. That is, as long as there are less than p subdomains 
the largest subdomain according to the number of triangles is divided into 
two new subdomains by the sbm. As a result, each triangle is assigned 
to one of the subdomains. 

Tri2DD converts triangular-based data into the DD-data format. The algo- 
rithm is based on the definition of edges which then define the faces. 
Note, interfaces between different materials will be maintained as they 
were given in the original tri-file. On the other hand the artificially 
created boundaries within one material are smoothed. 

AdapMesh creates a mesh based on DD-data using, optionally, adaptivity 
information. The resulting *.tn-file can be used as input for Decomp. 

During the preprocessing we are concerned with two types of describing 
data. The first one (*.tn-files) is based on nodes (characterized by their coor- 
dinates) which define the edges (straight lines or arcs of circles are allowed) 
and, finally, triangles defined by their edges. Each of the triangles is charac- 
terized by two additional parameters, where only one of them pointing to the 
material the triangle belongs to has to be initialized from the very beginning. 
The second one describes the mapping of the triangles on the subdomains 
and it is defined as a result of decomposing i7. 

The second type of data (*.dd-files) follows the DD-data format described 
in [14]. It is based on defining cross-points numbered globally which then 
define edges. Main objects are faces described by edges. The faces, or a union 
of them, are mapped onto the array of processors. The *. /5-files are auxiliary 
files (optional) and contain controlling data or fixed cross-points as it occurs 
in our example (see below). 

2.3 Example: Direct Current Motor 

Now we apply our preprocessing tools to a model of a direct current motor 
(dc-motor) with permanent excitation (see also Fig. 2 and 3 in [24] for a 
detailed description of the machine). We start with a *.tn-file with 18 different 
material regions as shown on the left in Figure A.l (Appendix A) . After 
applying Decomp, Tri2DD and one full cycle we have obtained the *.dd-file 
which represents the decomposition as shown on the right in Figure A.l. Note 
the additional cross-points especially on the outer circle which have been 
pre-defined in a *. /6-file. In our case no additional refinement information 
have been used so that the second decomposition (Figure A.l) did not differ 
significantly from the first one which were already obtained after applying 
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Decomp and Tri2DD. Figure A. 2 (Appendix A) shows, on the left, the 
mesh being finally used as initial mesh for computations and, on the right, 
the equipotential lines of the solution. 



3 Parallel Solution of Linear Coupled 

BE/FE— Equations Approximating Linear Magnetic 
Field Problems 

3.1 A Mixed Variational Formulation 

Let us first consider a linear (i/ = u{x)) magnetic field problem of the form 
(1.1) - (1.3) for which a domain decomposition according to Section 2 is 
available. In particular, we assume that the index set X = Xp U Xb can be 
decomposed into two disjoint sets of indices Xp and Xp such that 

PGXb, (3.1) 

(supp S(.) U supp Ho(.)) n = 0 \/ieXB, (3.2) 

i/(x) = Ui = const Vi E Xb- (3.3) 

For each {i E X) the index i belongs to one of the two index sets Xb and 
Xp according to the discretization method applied to i?i, where Xb and Xp 
stand for BEM and FEM, respectively. 

Following M.Costabel [7], G.C. Hsiao and W.L.Wendland [33] and others, 
we can rewrite the weak formulation of the boundary value problem (1.1) - 
(1.3) by means of partial integration in the boundary element subdomains 
i ^ Xb and by the use of Calderon’s representation of the full Cauchy data 
as a mixed DD coupled domain and boundary integral variational problem: 
Find (A, u) E V := Ax Uq such that 

a(A, u; T), v) = {F, v) V(t?, v ) £ V (3.4) 

where 

a{\,u\T),v) := aB{\u-,r],v) + ap{u,v) 
aBi\u;r],v) ~ ^ J^il{'DiUi,Vi)ri + ^{Xi,Vi)ri + {\i,K^iVi)ri + 

ViAi)rj - {rii,ICiUi)ri - 
(^P)VpAp)/’^ {Vpj ^p'^p) 4" 2^^P’^p)^+^ 
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aF{u,v) := / u{x)V^u{x)Vv{x) dx 

i&ip 

{F,v) := E / - Hoy(x)^ + Hox(x)^ 

L cbc ay J 

,Vi)ri ■- \vi ds and Vi = v\gni, Ui = u\gni, Fi := dfii, 



dx 



(A, 



with the well-known boundary integral operators Vi^lCi^Vi defined by the 
relation 

VjAi(x) := J S{x,y)Xi{y) dsy 

Fi 

K-iVi{x) := J dyS{x,y)vi{y) dsy 

Fi 

T>iUi{x) := -dx f dyS(x,y)ui{y) dsy 

Fi 

and with the fundamental solution 



(3.5) 



S{x,y) = -—\og\x-y\ 



(3.6) 



of the Laplacian. The mapping properties of the boundary integral operators 
(3.5) on Sobolev spaces are now well known [8]. The spaces Uq and A are 
defined by the relations 



Uo := {u e HH/2-) : u\r,, G Hi/2(rBs),«|a„„ = 0} 

A := {A = : A^ G H-i/2(r,), i G Is) = UieXs ‘ 

with Ai = H“^/^(r,), i G Ib- Further we use the notation Fbe •= 
U<6Zb \ ^d,Ffe ■= Uieip \ ^d,Fc ■= Fbe U Ffe and Hf := 
UiGXF Introducing in V := A x Uq the norm 



||(A,?i)||v (^\\M\a + W'^\rBE\\n^/2(^rBE) H^llHi(r 2 F)) ' 

with 

then one can prove that the bilinear form a(.,.) is V-elliptic and V-bounded 
provided that the domain decomposition satisfies the conditions imposed on 
above (see also [33]). Therefore, the existence and uniqueness of the solution 
are a direct consequence of the Lax-Milgram theorem. 
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3.2 The Coupled BE/FE Discretization 

Now, we can define the nodal FE/BE basis of piecewise linear trial functions 
based upon a regular triangulation of the subdomains Qi^i € Zp and the 
according discretization of the boundary pieces Fij — fl i?j, € X: 

^ = [(/»1 , . . . , J • • • » 4>Na+Nc » 4>Na+Nc^\ j j <t>N], 

where N = iV^ + iVc + iV/ and Nj - = EieiB ^A,i- 

Here, , . . . , </>iVA are the basis functions for approximating A on fi, i G Xb, 
0iVA+i 5 ■ • • j 4>Na-\-Nc represent u on Fc and (I>Na+Nc+i j j <I>Na-\-Nc+Ni ap- 
proximate u in i £ Xp. The definition of the finite dimensional subspaces 
of A, Uo and V 

Ah := span [0i,</>2, . . . 

IJh := span 
Vh := Ah X Uh 

allows us to formulate the discrete problem as follows: Find Uh ^ Vh such 
that 

a{uh, Vh) = (F, Vh) yvh e Vh- (3.9) 

The isomorphism ^ V/i leads to the linear system: 

/ Ka -Kac ^ \ f ^a\ f (a \ 

I Kca Kc Kci I I uc = I fc , (3.10) 

V 0 Kic Kj J \ui J \fij 

where the block entries are defined by 

(Kaua,va) = Y1 with A< = ^a^^a^V* = ^a,va„ 

i€^B 

{Kcaua,vc)= ^ + -2^\,Vi)ri} 

+i'p{{Xp,ICpVp)r^ - ■^{Xp,Vp)rA 

Kac = KZa 

Kc = Kcb + Fcfj with 

(/^CBUc.vc) = M'^iUi,Vi)ri, Ui - ^CiUCi, Vi = ^a^Ci and 

t€lB 

where u\q^ = ^p\ip^ v\qjp — ^pwp. Here, (i £ Xb) and {i ^ X) 

contain the basis functions for approximating A and u on d Hi ^ respectively. 
The basis functions in are used to approximate u in Hi {i £ Xp). The FE 
entries, especially Fj, are sparse matrices, whereas the BE blocks are fully 
populated. 
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3.3 The Parallel Solver 

3.3.1 Bramble-Pasciak’s Transformation and Spectral Equivalence 
Results 

The nonsymmetric, positive definite system (3.10) can be solved approxi- 
mately by Bramble/Pasdak’s CG method [2]. The method requires a pre- 
conditioner Ca which can be inverted easily and which fulfills the spectral 
equivalence inequalities 

j^Ca < Ka< with 7^ > 1. (3.11) 

With the definitions 



K^=Ka, fi={A, Kn = K^i = i-KAc 0) 
_( Kc Kci\ f _ [ -tc 
\ Kic Ki “ V -f/ 






we can reformulate (3.10) as a symmetric but indefinite system: 

(^. )(::)=( I:)- 

Following Bramble and Pasciak [2] this system can be transformed into 

Gu = p, where (3.13) 



G,^l 



Ca"K^2 

K2iC^\K^ - Ca) K2 + K2iCJ'Ki2 



- f2 



Then, the matrix G is self-adjoint and positive definite with respect to 
the scalar product [., .] which is defined by 



[w,v] := {{Ki - C^jwijVi) -I- (w2,V2). 
Moreover, G is spectrally equivalent to the regularisator R, where 

0 



(3.14) 



R 



-io 



K2 + K2iKr^Ki2 

Bramble and Pasciak [2] proved the spectral equivalence inequalities 



where 



a 



A- l4-“-l-\/a-h 



-1 






Vv G R^, 


(3.15) 


Y It y/u 

1 - a 


(3.16) 



with a = 1 - (1/ja)’ Thus, we have to find a preconditioner C 2 for the matrix 
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K2 + K 21 K, 



~^K\2 = ^caK^^Kac KcA 
^ V Kic Kj ) ' 



The DD preconditioner defined by 



C 2 






Ic KciBj 
0 



-T 



Cc 0 

0 Cl 



Ic 0 

Bj^Kic I I 



(3.17) 



(3.18) 



is spectrally equivalent to K 2 + K 2 \K-^ ^K \2 if we have preconditioners Cj 
and Cc fulfilling the inequalities 

Ic^c <Sc + KcaKJ^Kac < Tc^'C) (3.19) 

XjCi<Ki<^jCi, (3.20) 

where §c = Kc - KciKj^Kw + Kci{Kj^ - Bj'^)Ki{Kj^ - Bj^)Kic, 
and Bi is an appropriately chosen non-singular matrix [ 37 ]. 



Lemma 3.1. If the symmetric and positive definite block preconditioners 
Cl = diag{Ci^i)i^XF satisfy the spectral equivalence inequalities 

(3.19) and (3.20) with positive constants 7 ^, 7 j, then the spectral 

equivalence inequalities 

7.2^2 < K‘2 Kv2 < (3.21) 

hold for the preconditioner C 2 defined in (3.18) with the constants 

I 2 ^ ( 1 - \frh) ’ "^2 = max{7c,7/} ( 1 + ■ 

(3.22) 

Here fi = p{S^^Tc) denotes the spectral radius of Sq^Tc , with the FE 
Schur complement Sc and the operator Tc being defined by 



Sc = k c — k cil^i k IC andTc =kci{Kj -Bj^)Kj{Kj -Bj)kic^ 



o o o 

respectively. Kc^ Kci and K ic denote the non-zero FE blocks of Kc^ Kci 
and Kjc, respectively. 



The proof is given in [37], it applies the classical FE DD spectral equiva- 
lence result proved in [17, 18]. With (3.16), we conclude the following theorem. 

Theorem 3.1. If the conditions imposed on Ca, Cc, Ci, and Bj, especially 
(3.11), (3.19) and (3.20) are satisfied, then the FE/BE DD preconditioner 



C = diag{h,C2) 



(3.23) 



is self-adjoint and positive definite with respect to the inner product [., .] and 
satisfies the spectral equivalence inequalities 

7[(7 V, v] < [G V, v] < 7 [G V, v] Vv £ R^, 



(3.24) 
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with the constants 

7 = AminjljX^I and 7 = A max { 1 , 72 } , 
where X, X, 7 ^, 72 are given in (3.16) and (3.22), respectively. 

3.3.2 The Parallel PCG Algorithm 

For the vectors belonging to the inner coupling boundary Fc we define two 
types of distribution called overlapping (type 1 ) and adding (type 2 ): 

type 1 : uc, wc, sc are stored in Pi as uc,i = Ac^uc 

(analogous wc,i, sc,i) 

type 2: r(7,vc,fc are stored in Pi as rc,i, such that 

rc = Ei=i ^c,i^c,i (analogous vc,fc), 

where the matrices Ac^ are the “C-block” of the Boolean subdomain connec- 
tivity matrix Ai which maps some overall vector of nodal parameters into the 
superelement vector of parameters associated with the subdomain Hi only. 
Pi denotes the i^^ processor. 

Using this notation and the operators introduced in the previous section 
we can formulate an improved version of the PCG-algorithm presented in [37] 
with a given accuracy e as stopping criterion. This parallel PCG-algorithm 
is given in Appendix B. 

Note the vectors Zj = (z^,^,zc,i,Zyic,t)^ and h^ = (h^^j, hc,i, h^c,*)^ 
which have been inserted additionally in order to achieve a synchronization 
between the FEM and BEM processors especially in step 1 (matrix-times- 
vector operation). Without this synchronization one has to expect a com- 
putation time per iteration which is, depending on the problem, up to 30 
per cent higher. The definition of the vector avoids the computation of 
(occurred originally in step 4) which is not necessarily available {CA,i 
is defined such that the inverse operation C)(\wA,i can be performed easily). 

3.3.3 On the Components of the Preconditioner 

The performance of our algorithm depends heavily on the right choice of the 
components Ca, Cc, Cj and Bj defining the preconditioner C (see Theo- 
rem 1 ). Ca, Ci and Bj are block-diagonal matrices with the blocks Caa^ Cj^i 
and Bj^i, respectively. In our experiments, the following components have 
turned out to be the most efficient ones: 

Ciy. (Vmn) Multigrid V-cycle with m pre- and n post-smoothing steps 
in the Multiplicative Schwarz Method [18, 15]. 

Caa- {Circ) Scaled single layer potential BE matrix for a uniformly dis- 
cretized circle. This matrix is circulant and easily invertible [39]. 
(Hyp) Cjj = Mf(lKc,iM((}Ti as proposed by Steinbach [43]. 
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(HExt) Implicitly defined by hierarchical extension 
(formally Ejc,i = -BY}Kjc,i) [ 20 ]. 

Cc ‘> (S-BPX) Bramble/Pasciak/Xu type preconditioner [44]. 

(BPS-D) Bramble/Pasciak/Schatz type preconditioner [ 3 , 9]. 
(mgD) Kc*{Ic - {Kc*uc,vc) = de- 

scribed in [5]. 

These preconditioners C/, Cc, Ca^ and the basis transformation Bj sat- 
isfy the conditions stated in Theorem 3.1. In particular, inequalities (3.20) 
for Cl are fulfilled with constants 7 ^, jj independent of the discretization 
parameter h [18, 15, 20]. The preconditioner Ca is scaled such that 7 ^ > 1, 
and 7 ^ in (3.11) remains independent of h for both, (Circ) and (Hyp). In the 
case (Hyp), involves a basis transformation T*, a modified mass-matrix 

Mh,i and the hypersingular operator Kc,i, and property (3.11) for Ca is due 
to properties of the corresponding continuous operators and T>i [ 43 ]. 

With respect to Bj, the constant fi in (3.22) can be estimated by 

fjL < -h Cl l)‘^ < 4- C2(lnh~^))‘^ , 

cf. [ 20 ], with k being the number of local multigrid iterations, I being the num- 
ber of grids, the /i-independent multigrid rate f] < 1, and the /i-independent 
constants Cj. Thus, fx is independent of h if A: = 0{lnlnh~^). 

In the (S-BPX) case, the inequalities in (3.19) hold with an /i-independent 
constant 7 ^, and 7 ^^ < cs{l+fx) [20, 44]. Therefore, we can prove for (S-BPX) 
that 7/7 < 04(1 -f y/l H- /i)^ = 0{1) if A: = 0{\nlnh~^). However, 

in the range of practical applications, this means A: = 1 ! For (BPS-D), the 
estimate Jc/Tc — “b f^) bas been proved [3]. In the case 

(mgD), Cc arises from the hypersingular operator and C^^ is realized via 
a standard multi-grid procedure applied to the global operator (assembled 
over the subdomains) Kc* which is the discretization of a pseudo-differential 
operator of order one. Kc* becomes positive definite after implementing the 
Dirichlet boundary conditions. We can get an estimate of the same type as 
for (S-BPX). Note that the FE/BE>-Schur-complement energy is equivalent 
to the ||.||2 (see, e.g., [3, 5, 9]). 

Consequently, we can estimate the numerical effort Q to obtain a relative 
accuracy e by Q = 0{h~‘^ \nh~^ Inlnft"^ lne:“^) for the (BPS-D) case, and 
by <9 = 0{h~'^ ln\nh~^ lne“^) in the (S-BPX) case, i.e. almost optimal. If 
a BPX-type extension [38] is applied instead of (HExt) in a nested iteration 
approach [ 21 ], we can prove that Q = 0(/i“^), i.e., we obtain an optimal 
method. 

Preconditioners Ca,i for Ka^ and Cc for the FE/BE Schur complement 
derived on the basis of boundary element techniques can also be found in [35, 
39]. The construction of efficient FE Schur complement preconditioners was 
one of the main topics in the research on FE-DD-methods (see Proceedings 
of the annual DD-conferences since 1987). 




Adaptive Domain Decomposition Methods 133 



3.4 Numerical Results 

The electromagnet as shown in Figure 3.1 serves now as test example for 
exterior magnetic field problems which lead to the variational form (3.4). 




Fig. 3.1. The magnet and the subdomains being used (left) and the equipotential 

lines of the solution (right) . 

The copper domains (I, II), where we have a current density of the 
strength S and -S, respectively, and the iron domain (III), are squares with 
the edges being 16cm long. The material dependent coefficients (air: IV- 
VIII) are given by vcu = 795779.0 AmV-ls-^ Vair = 795774.4 AmV-ls-^ 
VFe = 1000.0 AmV-^s-^ 

Table 3.1. Number of unknowns (iV), iteration count (/(e), e = 10“^), CPU time 
in seconds. The experiments were carried out on a Power-XPlorer using 8 or 
4 processors, respectively. 





FEM: 

BEM: 


I-VII 

VIII 


FEM: 

BEM: 


I-III 

IV-VIII 


FEM: 

BEM: 


I-III 

exterior 


1 


m 


CPU 


I(£) 


CPU 


1(e) 


CPU 


1 


15 


0.6 


17 


0.5 


14 




2 


19 


0.9 


18 


1.0 


19 




3 


20 


1.6 


19 


1.9 


20 




4 


21 


5.2 


21 


5.2 


21 




5 


23 


20.5 


22 


18.8 


22 


22.5 


N(5) 


67329 


18429 


16129 



We will compare two different coupling procedures. On the one hand we 
use the natural boundary (of the metallic material) as coupling boundary 
and on the other hand we introduce a circle with the radius 50cm as coupling 
boundary. The advantage of the second method is that we obtain circulant 
matrices which can be generated very fast whereas the first method requires 
the generation of fully populated matrices. A disadvantage of the second 
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method is that additional subdomains and, thus, additional unknowns have to 
be introduced. For this example, the uniqueness of the solution is guaranteed 
by the radiation condition, even if no Dirichlet boundary Fd is present. The 
radiation condition is implicitly contained in our BE discretization. 

Numerical results are given in Table 3.1. The operators Ca, Co, C/, Bj 
have been chosen as follows: 

CA,i : Circ {i = 8) and Hyp {i elB\ {8}) Cc ■ S-BPX 

Ci^i : Vll (i € If) : HExt {i G If). 

We will compare two different coupling procedures. Comparing the three 
choices of the subdomains and their discretization, we observe that in the 
first choice (column 1) much time is spent for handling the FEM subdomains 
IV, V with many interior nodes. We may conclude that the BEM (column 
2) is recommended for subdomains with a high ratio between the numbers of 
interior FEM nodes and the boundary nodes. The choice of the rectangular 
coupling boundary (column 3) increases the BEM system generation effort, 
but the total time remains nearly the same since the number of BEM un- 
knowns (and subdomains) is reduced. Finally we observe that the number of 
iterations is independent of the combination of BE/FE discretizations being 
used. 

4 Parallel Solution of Coupled BE/FE-Equations 
Approximating Nonlinear Magnetic Field Problems 

4.1 The Nested-DD-Newton-Solver 

Let us consider now a nonlinear magnetic field problem, i.e., we allow ferro- 
magnetic materials with non-constant permeability i/ to be in the FE subdo- 
mains. Then, the mixed DD coupled variational problem can be written as 
follows: Find (A,u) G V := yl x Uq such that 

aN{\,u]T],v) = {F,v) y{f],v)£Y, (4.1) 

where 



a{\u;r],v) := aB(A,u; ry,i;) -h aFN(u,v) 



aFN{u,v) 




i/(x, \Vu\)V'^u{x)Vv{x) dx 



and aj 3 being defined as in (3.4). We refer to [25, 30] for the analysis of 
nonlinear magnetic field problems. Consequently, the discretization results in 
a nonlinear system [26] 
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where the nonlinear operator K : — > R^ can be split up into the 

nonlinear operator Kp • — y R^ originating from the nonlinear 

form apN and the linear operator Kp : — y R^ originating from 

dB [26]. 

The nonlinear system (4.2) is solved by Newton’s method, see [26]. In 
this algorithm, the linear Newton defect system (4.5) with the Jacobi matrix 
K^[u] can be written as 




It can be rewritten in a block form similar to (3.10), 



[Ka -Kac 0\ /wA /dA 

I Kca Jc Jci 1 ( wc I = I I ’ (4-4) 

Y 0 Jjc Ji ) \W// \d// 

and can be solved by a PCG method as described in Subsection 3.3. Further, 
we apply the ’’nested iteration” method [21, 24], i.e. we generate a multilevel 
sequence of coupled FE/BE discretizations denoted by the grid numbers q = 
1,. We begin with solving the nonlinear system by Newton’s method 
on the coarsest grid q = 1. Then we take the approximate solution on the 
grid q — I interpolated onto the finer grid g as an initial approximation for 
Newton’s method on the grid q^ for q = 2, . . . J. This allows us to ’’catch” 
the nonlinearity on the coarsest grid, see [24, 26, 28, 30]. 

The result of the parallel algorithm presented in the following will be 
the approximate solution on the fine grid q = I with relative accuracy e 
(nested iteration e). 

Algorithm PNN (Parallel Nested Newton) 

Step 0 

Initialization of the grid number: 

(0.)(P) q ■.= 1. 

Step 1 

Set the initial solution for grid q: 

(1.1) (P) IF q=l THEN = 0; 

(1.2) (P) IF g > 1 THEN u° = 

the initial solution is the interpolation of the best solution on grid q — 1. 

(1.3) (P) Initialize the Newton iteration number j 0. 

Step 2 

Compute the initial Jacobi matrix and the defect vector 
(2.)(P) dj = f, - 
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Step 3 

(3.)(P) Choose a relaxation parameter with 0 < < 1 and a relative 

accuracy parameter eun with 0 < e:iin < 1. 

Step 4 

(4.)(S) Solve the linear defect system 

= df 1 (4.5) 

approximately (with relative accuracy e-nn) using a PCG solver as de- 
scribed in Subsection 3.3. The result is 

Step 5 

Correct the solution: 

(5.)(P) =uJ+t|wJ+i. 

Step 6 

Control the convergence (parameter Cr is chosen a priori with Cr < 1): 

(6.1) (P) Compute the new defect vector and the new Jacobi matrix 

df 2 ^ jj+1 ^ AT'fuf 1]; 

(6.2) (C) Compute defect norms 

:= mm jcrr^, | ; 

GOTO Step 5 

:= 5 

IF q< i%HEN {q:=q-\- 1; GOTO St 
IF q = l THEN EXIT; 

(6.5) (P) Perform a further Newton step: 
j •= j 4- 1; 

GOTO Step 3. 

In this description, (P) indicates that the step is performed completely in 
parallel, i.e., independently on the processors. The solver (S) includes parallel 
independent parts, communication between processors handling neighbouring 
subdomains, and global communication, cf. Subsection 3.3. Note, (C) indi- 
cates that global communication is necessary. Obviously, the only additional 
communication (compared with solving a linear problem) is the computation 
of global defect norms. 

4.2 Numerical Results 

A direct current motor designed for electronic devices (electronic motor) 
which is excited by permanent magnets serves as a first real-life test exam- 
ple. The interior of the machine is discretized by finite elements (cf. Figure 
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4.1). Calculations have been made for both the machine with homogeneous 
Dirichlet conditions on its boundary and the infinite domain with Sommer- 
feld’s radiation condition where the infinite exterior domain is discretized 
by BEM. In order to obtain efficiency results, we consider additionally two 
model problems representing a quarter and a half of the whole machine, i.e. 
we discretize only sectors of 90 and 180 degrees, respectively, and impose 
Dirichlet conditions on the boundary. 






























W5i^ 



Example 



subdomains 



No. of unknowns 



Newton it. 

CG iter. 1st grid 
CG iter. 2nd grid 
CG iter. 3rd grid 
CG iter. 4th grid 
CG iter. 5th grid 
Newton it. 

CG iter. 6th grid 



generation 
linear solver 



Total time 



Dirichlet 
b. c. 



16 FBM 



374 129 



Dirichlet 
b. c. 



32 FEM 



734 199 



Dirichlet 
b. c. 



64 FEM 



1 514 008 



23.2 

71.7 



94 

3.633 

0.908 



Time in seconds, scale-up (normalized) and scaled efficiency (relative) on a 
GC-Power Plus using 16, 32, 64 processors, respectively; 2 Newton iterations on 
the grids 2-5, relative accuracy e = 10“®. 



radiation 

condition 



63 FEM 
1 BEM 



1489 416 
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The components of the PNN algorithm are chosen in the standard way 
[23, 24, 28]. In particular, the parameter sun can be adapted to the quadratic 
convergence speed of the Newton method [24, 30]. Here, a slash (/) marks 
the change from enn = 10“^ to £:iin = 10”"* in the accuracy of the CG solver. 
The components of the PCG solver have been chosen as follows: 

Ca,i - Circ {i £Tb) Cc '• BPS-D 

Ci^i : Vll {i € If) Bj^i : HExt (i € Ip). 

The components involved and the numerical effort for nonlinear problems 
are discussed in [26]. Numerical results are given in Table 4.1. We present 
further results, in particular with respect to efficiency, in [30]. Computations 
with up to 128 processors are documented in [26, 27]. In [28], the application 
of a parallelized global multigrid method based on DD ideas in Step 4 of the 
algorithm PNN is discussed. Further, in [29], we demonstrate that the CG 
with a global BPX or a global multigrid preconditioner yields a robust solver 
for practical problems. 



Table 4.2. Performance for the dc motor. 



Example 


Dirichlet 
b. c. 


Dirichlet 
b. c. 


radiation 

condition 


radiation 

condition 


Choice for Cc 


BPS-D 


S-BPX 


BPS-D 


S-BPX 


subdomains 


32 FBM 


32 FEM 


31 FEM 
1 BEM 


31 FEM 
1 BEM 


No. of unknowns 


417328 


417328 


414 568 


414 568 


Newton it. 1st grid 


mmgim 


■■Mi 


7 


7 


CG iter. 1st grid 






19,14,16, 

16,12,19,28 


10,14,14 

14,14,13,17 


Newton it. 2nd grid 


■KM 


■MM 


■KM 


■MM 


CG iter. 2nd grid 










Newton it. 3rd grid 


2 




■MM 


■MM 


CG iter. 3rd grid 


17,27 


l^lli 




■EMI 


Newton it. 4th grid 




■MM 


2 


■MM 


CG iter. 4th grid 






18,34 




Newton it. 5th grid 




■■■■i 


■MM 


4 


CG iter. 5th grid 








20,32,27,33 


Time (generation) 


25.2 


25.2 


33.6 


33.6 




168.9 


137.8 


178.8 


198.5 


Total time | 


194.1 


163.0 


212.4 


232.1 



Time in seconds, GC-Power Plus, 32 processors; relative accuracy e = 10 ^ 



The second practical example, a technical direct current motor (dc motor, 
see Subsection 2.3), is to demonstrate the complete algorithm. Starting with 
a user mesh, we apply an automatic domain decomposition procedure (see 
Subsection 2.3 and Appendix A) and a parallel mesh generator, the basic 
ideas of which are presented in [11], to obtain the initial mesh (^ = 1), 
which is to be refined four times to get the final mesh for our computations 
(^q = I = S). We present the numerical results in Table 4.2 and the level lines 
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of the solution in Figure A. 2. Again, we have done calculations for both, 
the machine with homogeneous Dirichlet conditions on its boundary and the 
infinite domain with the radiation condition. Best results with respect to 
the total computing time have been achieved with sun = 0.01. All other 
components of the algorithm, except (7c, are chosen as for the electronic 
motor example. 



5 Generalization to Linear Elasticity Problems 

5.1 The Mixed Boundary and Domain Integral Variational 
Formulation 



We now want to extend the ide^is discussed above to problems of plane linear 
elasticity in which the displacement u{x) = {ui{x),U 2 {x))'^ satisfies formally 
the system of Lame equations 

—/ji{x)Au{x) — (A(x) + /x(x))grad divu(x) = f{x) in i? 
u{x) = 0 on Fd, aki{u{x))m = gk(x) on Fn, {k = 1,2) 

where i? is a bounded Lipschitz domain, (Jki{u) are the components of the 
stress tensor a{u) and n(x) = (ni(x),ri 2 (x))^ is the outward normal vector 
to Fd U F]sf = r := dn (Fd / 0) and A and )Lt, A,/i > 0, are the Lame 
coefficients of the elastic material. In (5.1), / = (/i,/ 2 )^ is the vector of 
volumic forces, g = ( 91 , 92 )^ is the vector of boundary tractions. 

As before, i? is being decomposed into non-overlapping subdomains i?i, 
i — 1, . . . ,p, cf. (2.1). We then have fXi and A^ as Lame coefficients for each 
of the i?i’s. Similar to (3.2) we assume that the volumic forces vanish for 
X e Oi with i els - For simplicity we assume additionally that F^FdOi = ^ 
for i E Tb- Then we can write the variational formulation as follows: Find 
{(7,u) € V := yl X Uq: 

a{a,u\T,v) = (F,v) V(r,t^)€V, (5.2) 

where 



a(cr, u] r, v) := aB(cr, w; r, v) -h ai?(w, v) 

aB{cr,u;T,v) ^ I’i \ {F>iUi,Vi)ri + ^{<Xi,Vi)ri + {(Ti,K.iVi)ri 
ieiB '• 

ap{u,v) := ^2 / \ div u{x) divv{x) + 2 Hi ^ ^kl{u) eki{v) j dx 

:= E / f{x)v{x)dx-\- / g{x)v{x) ds, 
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with the duality pairing (., .), the traces U{ = u\do. and the boundary trac- 
tions a = [cTi]ieXB belonging to dOi and the strain €ki{u) := {dukfdxi 4- 
duifdxk)f2. Note, for simplicity we have assumed that i? is bounded such 
that, in contrary to Section 3.1, special terms for the exterior domain do 
not occur in the definition of as Nevertheless, the ideas presented above 
concerning exterior problems can be applied analogously. 

Let the spaces Uo and A be defined as follows: 

Uo := {uG[H 1(/2)]2: u|r^^€Hi/2(rBE),«|r„=0} 

where Fbe ■= Uieis \ •“ Uieip \ := Fbe U Fee- 

Let Op ■= then we consider the following norm in V: 

\\{a,u)\W ■■= ilM + + II«IIhm«.))'^'- (5-3) 

The boundary integral operators V^, ICi and T>i are defined as in (3.5), where 
d{.) has to be replaced by the operator T(.) which is defined in its strong 
form as T(.) := 2//9(.) -f Xndivu -f fxn x curl{.) and in its weak form by 
the first Green formula. For S{x,y) we now have to insert the well known 
Kelvin fundamental solution (see, e.g., [6]). The variational formulation (5.2) 
has a unique solution provided that the single layer potential operators Vi 
are positive definite (H“^/^(9i7i)-elliptic) [6, 32, 40]. 

Similar to Section 3.2 we discretize (5.2) to obtain a system of equations 
which is manipulated in the same way as discussed in Section 3.3.1. This leads 
again to a symmetric, positive definite system matrix. Thus, the parallel solu- 
tion can be performed in a similar fashion provided suitable preconditioners 
Ca, Cc, Ci and Bj are known (see Section 5.2 and [40]). 

5.2 Numerical Results 




Fig, 5.1. The subdomains and the BE discretization of the level (left) and the 
deformed (magnification factor 100) FE grid of the 2"^ level (right). 
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Table 5.1. Levels (/), pure solution time (s), number of unknowns (AT), iteration 
count (/(e), e = 10“^), CPU time in seconds for the dam-problem. The 
experiments were carried out on a Power-XPlorer using 8 processors. 





BEM: 


I-VHI 


FBM: 

BEM: 


III-VHI 

I-H 


FEM: 


I-VHI 


1 


1(e) 


CPU 


1(e) 


CPU 


1(e) 


CPU 


2 


27 


4.9 


25 


4.6 


25 


3.9 


3 


27 


8.8 


28 


8.8 


28 


7.9 


4 


28 


24.1 


30 


27.9 


31 


22.8 


5 


29 


85.9 


31 


104.9 


32 


81.3 


s(5) 


49.5 


68.5 


63.8 


N(5) 


6470 


78130 


119318 



As a test problem we consider a model of dam filled with water as sketched in 
Figure 5.1. As indicated there, boundary conditions are given on Fd (zero dis- 
placement) and on Fn (zero or according to the water pressure, respectively). 
The Lame constants are given for rock (I-II) by = 7.265e4MPa, A,. = 
3.743e4MPa and for concrete (III- VIII) by — 9.2e6MPa, = 9.2e6MPa. 

For the results presented in Table 5.1, the operators Ca-, Cc, C'j, Bj have 
been chosen as follows: 

Caa : mgV {i els) Cc • mgD 

: Vll(HExtd-S) {i G If) : HExt+S {i G Xf). 

Here, (mgV) stands for a multigrid-based preconditioner for the single layer 
potential (see [1]). Furthermore, new algorithms for Ci^i and Bj i have been 
used. In particular, the hierarchical extension (HExt-fS) has been improved 
by a coarse grid solver and smoothing on the other levels [13]. 

The BE discretization of the 1^^ level and the FE discretization of the 2'^^ 
level (deformed mesh) are shown in Figure 5.1. We use piecewise linear trial 
functions for the displacements and piecewise constants for the boundary 
tractions. The entries of the BE matrices are computed fully analytically. 

In Table 5.1 we present several combinations of FE/BE discretizations. 
Looking at the CPU-time we observe that the FE discretization (column 3) 
leads to the best results. However, if we are interested in the pure solving 
time s{.) ( 5 ( 5 ) for the 5^^ level is given in Table 5.1) the BE discretization 
(first column) is of advantage. 



6 Concluding Remarks and Generalizations 

The DD-method hats turned out to be a powerful tool for establishing the 
coupled FE/BE variational formulation and for solving the discrete systems 
efficiently on massively parallel computers. The results presented here have 
been obtained using the code FEMQDBEM [14] which can solve linear and 
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non-linear magnetic field problems as well as problems arising in linear elas- 
ticity. The high efficiency and the scalability of the algorithm has been demon- 
strated [5, 16, 30, 26, 28, 36]. 

Comparison of ” local” DD methods described in this paper with ’’glob- 
al” multigrid methods implemented on massively parallel machines as well 
as workstation clusters is given in [28]. The use and the parallelization of 
” global “ methods is also discussed in [4]. Other coupling and solution tech- 
niques are studied in [10, 41, 43]. 

The techniques presented here can be generalized to the 3D case provided 
that fast matrix-by- vector multiplication routines for the BE matrices, e.g. 
based on Panel-Clustering-Techniques developed in [22], and asymptotically 
optimal, or almost optimal components (7/ (e.g. multigrid preconditioners), 
Bj [38], Cc (e.g. BPX) and Ca [35, 39, 40] of the preconditioners are available. 
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B The Parallel Algorithm 



FEM {i € If) BEM {i e Ib) 



Starting Step 



Choose an initial guess u = uo 
Ui = [uc,i I Ui = [uA,i 



r/,i — VA,i — ~l" KAC,i^Cji 

TA,i = C^]^^A,i 



rc,i = fc,» — Kc,iuc,i — Kc/,iU/,» rc,i = fc,» — Kc,»uc,i KcA,iUA,i 

rc,» — rc,» — KcA,iVA,i 

vc,i = rc,» - Kci,iBYjri^i wa,» = r^,*; p» = VA,i 

^C,i — ^C,i) ZA,i — KA,jWA,i 

Wc,i = ^C7,iWc 4- WC = -> Wc,i = ^C,iWc 



w/,j = Cj -ri,i - BjjKic,ivrc,i 


ZAC,i = i 


s = w 


s — w 


= Tc.jWc.i + rf,jW/,j 


II 

b 


II 

O 





Iteration 



1. v/,i = Kic,%sc,i H- 

V(7,i = Kc,isc,i 4- Kci,iSi^i 
Si = vjsc + 



WA,i — ZA,i ~ ZAC,i 

Vyi,i = 

Vc,i = Zc,i H- KcA,i(sA,i ~ Vyl,^) 

+ ^A,i^A,i - ^A,i^A,i 



a = a/S ^ ^ “ EiLi ^ oc — a/S 



— av 



vc,i = rc,i — K(7/,»Hfff/,i 


^C,i — ^Cfi] h^ji — KAjiTAji 


wc,i = Ac,iwc i-wc = C^^] 


CLi ^ wc,i - i4c,iwc7 


w/,i = Cyjri^i — BylKic,i^c,i 


WA,i = P» — P» OrWA,i 

hAC,i = KAC,iWc,i; hc,i = Kc,iW(7,i 



= rS,iWc,i + r/,iW/,i 



= w» 4- /3s< 



(Ji = r5,iWc,i + - pfr^, 



^ ^ = /5 = a/a 



= Wi H- /3si; Zi = hi 4- /3z 



If ^- < ♦ (T®, then STOP 

else goto step 1. 
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On the Use of a BEM Time-Stepping 
Procedure for Nonlinear and Unsteady 
Wave-Structure Interaction 

0. Mahrenholtz, V. Schlegel, and C. Haack 



1 Introduction 

Computer aided analysis of fluid-structure interaction in ocean engineering 
problems requires efficient and reliable numerical methods. Due to free surface 
flow and gravity waves they have to consider: large and arbitrary shaped do- 
mains, free and moving surfaces, i.e. nonlinear and time-dependent boundary 
conditions. Assuming a two-dimensional potential flow, the direct boundary 
element formulation (BEM) is the most efficient solution method. Compared 
to other discretization methods it has minimum discretization expenses, a 
better representation of complex boundary geometries, and a simple approach 
to moving boundaries. This is achieved by coupling the BEM with a time- 
stepping procedure. For numerical simulation of time-dependent, nonlinear 
(overturning) gravity waves this approach, introduced by Longuet-Higgins 
and Cokelet [10], now is an optimized and reliable method in use, [6, 7, 12, 14]. 
Recently even a three-dimensional wave analysis is proposed: [1, 3, 15, 20]. 
The extension of this BEM time-stepping approach to wave-structure inter- 
action is reasonable, [19], especially to wave channel simulations, [6, 18]. A 
direct BEM formulation allows for different kind of boundary conditions on 
each finite boundary element. Hence, this proposal pay regard to a wide range 
of different wave-structure interaction problems, ref. Fig. 1.1. 
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Fig. 1.1. Wave-Structure interaction in wave tank applications 
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In this contribution some special aspects of BEM time-stepping proce- 
dures for the analysis of nonlinear and unsteady wave-structure interaction 
are discussed: 

- space and time discretization (numerical stability, time-step control), 

- fluid particles inside the domain, 

- absorbing boundary condition, 

- pressure formulations for freely and constrained floating structures. 

The proposed numerical wave tank simulation obviously ask for proof of 
modeling and accuracy by wave tank experiments (typical laboratory facili- 
ties for the analysis of wave-structure interaction). Therefore, the aforemen- 
tioned aspects are analyzed by direct comparison of experiment and BEM 
computation. 



2 Wave-Structure Interaction 

The interaction of gravity waves with flxed or floating structures is consid- 
erably complex, see Fig. 2.1. Within the time [ti, ^ 2 ] the geometry of the 
boundary P and the submerged surface of the structure is completely chang- 
ing. 

This is described by the kinematic and dynamic free surface condition and 
the equations of motion of structures, respectively. In case of closed domains 
(tanks, containers etc.) also a change of the mean water level (SWL) has to 
be considered. 




Fig. 2.1. Time-dependent wave-structure interaction 
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2.1 Free Surface Flow 



If an irrotational flow and an incompressible fluid are assumed, the flow 
field w can be expressed by the gradient of a scalar velocity potential </>, i.e. 
w = V 0. The conservation of mass now is described by Laplace’s equation 

divw = div V (f) = = 0 , (2.1) 

and the conservation of momentum reduces to the Euler equation (inviscid 
flow) 

e ^ ^ ■*' w) = - Vp + k , (2.2) 

with the fluid density q, the pressure p and the specific volume forces k. The 
integration of equation (2.2) with respect to spatial coordinates gives the 
scalar Bernoulli equation. For a point P, which is arbitrarily moved in the 
flow, it becomes the form 

dt 

d<t>/dt 



w 



p Q 



gyp - 



+ V^w, 



(2.3) 



where g is the gravitational acceleration and y gives the vertical displace- 
ment. The nonlinear free surface flow is described by the kinematic and dy- 
namic boundary condition of a fluid particle F on the free surface. From a 
Lagrangian formulation the velocity vp of the fluid particle is obtained as 
(kinematic free surface condition) 

^ ^ = w = V , (2.4) 

where rp is the position vector of the fluid particle. With vp w, and with 
the pressure at any position of the free surface is equal to the atmospheric 
pressure Pamb> equation (2.3) reads 



D4> 

Dt 




1 

Pamb - 9VF + 

Q 




(2.5) 



and gives the time-dependent and nonlinear dynamic boundary condition of 
free surface flow. 



2.2 Rigid Body Motion 

Within the framework of potential theory and the herewith derived linear 
wave theories floating body dynamics are normally given by linearized equa- 
tions of motion, [ll]:i.e. small amplitude motions of waves and structures. In 
case of time-dependent nonlinear wave-structure interaction a more general 
formulation has to be used. 




152 O. Mahrenholtz, V. Schlegel, C. Haack 

Therefore, the equations of motion of coupled or constrained floating 
structures are generated by the Newton-Euler formalism. Due to the cou- 
plings, the position vectors and the matrices of rotation of the considered 
bodies are no longer independent. The systems position now is described by 
(independent) generalized coordinates qi of the vector q. When introducing 
kinematical relations, the Newton-Euler equations are given by the applied 
Ql and constraint Q* forces and torques acting on the bodies 

M*q + li:* = Q: + Q: . (2.6) 

Here, M* = M J denotes the product of the block diagonal mass matrix 
M with the Jacobi matrix J of the system. All quantities independent of 
q are assigned to the vector of gyroscopic, centrifugal and Coriolis forces 
K*. The vector Ql describes the applied forces and torques, including hy- 
drodynamic interaction, while the vector Q* denotes the constraint forces 
of the couplings. D’Alembert’s principle is applied, in order to reduce the 
number of unknowns. Thus, the reduction of Newton-Euler equations (2.6) 
is achieved by left multiplication with the transposed Jacobi matrix J^, and 
the constraint forces Ql are vanishing. 

For an arbitrary point P (fixed, constrained or fi:*eely moving), see Fig. 
2.1, the pressure is obtained by rearranging Bernoulli’s equation (2.3) to the 
form 




Integrating the pressure field p (xg, t) on the submerged surface Tg of a struc- 
ture gives the hydrodynamic force (see Fig. 2.2) 





Fig. 2.2. Hydrodynamic pressure and forces 
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with the unit normal vector n(xs,t) directed out of the fluid domain. The 
position Xs of the submerged surface depends on the position Xfo(t) of the 
free surface (water sloshing, etc.), the position of the structure, and time. 
With respect to an arbitrary point Q, the resulting hydrodynamic torque 
reads 

-^h = J (xs) X (p (xs, i) n(xs) ) d 7 , (2.9) 

r, 

where rgr^Cxs) is the time-depenent distance vector from Q to the submerged 
surface Tg, ref. Fig. 2.2. In case of two-dimensional applications the form 

M^=rQAxFh (2.10) 

is more convenient. Here, the hydrodynamic forces are directly applied; how- 
ever, the unknown time-dependent distance vector tqa, connecting the two 
points Q and A, now has to be described. The equations (2.7)-(2.10) provide 
all relevant quantities of a hydrodynamic analysis. 



3 BEM Time- Stepping Technique 

The dynamics of wave-structure interaction is treated as an initial value prob- 
lem given in state-space representation by the following system of coupled, 
nonlinear ordinary diflFerential equations (ODEs) 





' Xi=q 




■ X2 
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Due to the pressure formulation (2.7), the vector of generalized forces 
depends on </> and 6(j)/dt. Therefore, the numerical solution procedure of wave- 
structure interaction has to consider both: the velocity potential (j) and its 
time derivative. 

By means of a BEM approach, see e.g. [2], Laplace’s equation (2.1) is 
transformed to a boundary integral equation (BIE) 

C{i) m = f Wn(x) G(x,0 - </>(x) d r(x) , (3.2) 

J on 

r(t) 

where r{t) is the boundary of the considered fluid domain, (7 is a geometry- 
dependent factor, G the fundamental solution of the problem, (j) the velocity 
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potential, Wn = d(j>/dn its normal derivative and x, ^ denote the position 
vectors for field and source points, respectively. This integral equation is dis- 
cretized by NE finite boundary elements (Fig. 3.1) with the approximations 
(j) and Wn and results in 

r £ \ NE « 

Cim) + E/ dn ^ y Wn(x)G(x,^,) dr, (3.3) 

1 p. p. 

^3 ^3 

with the nodes of the boundary discretization. 




Fig. 3.1. Fluid domain i?, boundary F, and boundary conditions (j) and d(t>/dn 



The main advantage of this approach is that on each boundary element 
either Dirichlet (</>) or Neumann (d(j)/dn) data can be given, see Fig. 3.1. 
Hence, it can be applied to a wide range of different problems, depending on 
the boundary conditions only. The approximation of the boundary (geometry 
and conditions) depends on the order of the Ansatz-functions or the type of 
boundary elements, respectively. For BEM simulations of nonlinear wave- 
structure interaction especially 

— cubic spline-elements on the free surface and 

— quadratic elements on all the other boundaries 

show best results, [6], [8]. Depending on these element formulations the given 
boundary data (0(x), x £ Fi and Wn{x), x G F 2 ) is interpolated and the 
unknown boundary data {^{x), x £ F 2 and Wn{x), x £ Fi) is approximated. 
The BEM approach (3.3) now results in the more convenient matrix formu- 
lation 



[(c + h) -g’ 


<P 


= [G - (C + H)] 


’ Wn ' 


L \ / 


_ Wn _ 







Ap 
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where the matrix of coefficients Ar typically is a dense and unsymmetric 
matrix. In this BEM proposal geometrical singularities of corners are treated 
by a double-node concept. The use of spline-elements asks for additional 
boundary data with respect to the local coordinate rj of the finite boundary 
elements 





d(p 


T 


dWn 


= 


r 


and Wn — 


Of] 



This allows with the Jacobian matrix J direct computation of the kinematic 
boundary condition in (2.4) or (3.1), respectively 



^ = w = V(^(rF,t) 



d^jdx 


= J-* 


d(j)l dr] 


d(j>ldy 




d(j)/dn 



(3.6) 



Moreover, an additional smoothing (see e.g. [20]) is not necessary anymore. 
If using the double-node concept and spline-element formulations, a least- 
squares solution technique of the BIE (3.2) is reasonable 

A^^QAry = A^;.Qb. (3.7) 



All additional constraints are considered in the least-squares sense within 
the BEM solution procedure, with a now dense but symmetric and positive 
definite matrix of coefficients A^^QAr. Here, Q is a diagonal matrix of the 
integration weights multiplied by Jacobian’s determinant. 



3.1 Pressure Formulation 

By means of the BEM solution and the state-space representation (3.1) all 
quantities of the pressure equation (2.7) are given now, except the time- 
derivative of the potential </>. Obviously there are two ways to describe the 
hydrodynamic pressure: 

— to give an finite difference approximation of d^/dt or 

— to solve a BIE for d(j)/dt = 0 (though there is no physical interpreta- 
tion). 

Latter depends on the application. Besides (j) and d(f)/dn, additional boundary 
conditions d(t>/dt and d^<t)/dn dt are now required. In case of fixed structures 
[17], or linear dependencies this is possible [4, 19]. Considering nonlinear 
motion, it becomes most difficult, [5]. The additional BIE now depends on 
the problem under consideration. 

Therefore, the more general finite difference approximation is used here: 

d</>(rp (tj), tj) ^ <t> (rp {tj + Atj), tj + Atj) - (j) (rp (tj), tj) 
dt Ati 



(3.8) 
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This is easily implemented into a higher order time-stepping scheme, e.g. a 
4th order Runge-Kutta method with equidistant time-levels (Newton’s 3/8- 
rule). Moreover, equation (3.8) is independent of the problem and gives a local 
formulation. In the context of BEM: required data refers to single nodes on 
the boundary and is given directly by the BEM solution of (3.4). Once the 
hydrodynamic pressure is known, the hydrodynamic forces are computed and 
refering (3.1) they give the rigid body dynamics, i.e. 

V‘^<t>{t) => (j){t) p{rp,t) Fh(xs,t) ^q, q, q. (3.9) 

Using basic kinematical relations the velocity of an arbitrary point P on the 
structure, ref. Fig. 2.1, is given in generalized coordinates q by: 

[vp(q,q,t)]j = [re (q,q,t)]j + w(q,q,t) X [«(q,f)]ic [pp]c , (3.10) 

with the position vector rc of the center of mass, the position vector pp of 
the point P in a body-fixed frame, the vector of angular velocities u;, and 
the matrix of rotation 17. The boundary condition on the submerged body 
surfaces now reads 

wn = = - [vp (q,q.t)]i p ■ (3.11) 



3.2 Space and Time Discretization 



The solution of the proposed BEM time-stepping procedure depends on the 
order of approximation and the discretization in space and time. In order 
to improve accuracy and avoid (saw tooth) instabilities at the free surface 
higher order approximations are used, [6]: 

— cubic spline-elements (BEM) and 

— an explicit 4th order Runge-Kutta scheme. 



Because the BEM is part of the operator of the initial value problem (3.1), 
the total error of the BEM domain solution has to be less than the total 
error of the Runge-Kutta method. In case of a smaller time-step, the space 
discretization has to be adapted, i.e. the number of boundary elements has 
to increase. Hence, the number of operations and CPU-time does not depend 
linearly on size and number of time-steps. For many practical applications a 
fraction Ax/ At < 5 of space/time discretization shows reliable results. 

Another criterion that typically is used for these kind of problems, e.g. 
[13, 20], is the condition 



CFL = 



dr(0 At ^ ^ 
d t Ax{t) ~ 



(3.12) 



of Courant-Friedrichs-Lewy (CFL). Thus, a node ^(t) of the moving boundary 
r{t) is allowed to move at most the distance Ax within the time At. During 
the numerical simulation the quantities Ax/ At and CFL are computed on 
the critical, i.e. moving boundaries. They serve as a criterion for numerical 
stability as well as for time-step control. 
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4 Numerical Wave Tank Simulation 

As mentioned typical laboratory facilities for wave-structure analysis are wave 
tanks. The main aspects of the proposed numerical method for the time- 
domain analysis of wave-structure interaction are directly compared with 
experimental data of the wave flume at hand: length t = 12.3 m, width 
b = 1.5 m, and water depth d = Im. It is operated by a flap-type wave 
generator with or without a wave absorber at the end of the tank, ref. Fig. 
1.1. The experimental set up gives e.g. wave elevations at different positions 
in the flume and the hydrodynamic pressure on the wave generator. 

4.1 Space Discretization and Time-Step Control 

The local representation of the time-dependent free surface wave elevation is 
depicted in Fig. 4.1. The solid line gives the experimental results of a wave 
probe and the symbols show the numerical results of the BEM time-stepping 
procedure, using 20 or 80 boundary elements on the free surface. Obviously 
the resolution of the gravity waves becomes more accurate the more elements 
are used. This test (both experiment and BEM) is carried out without any 
wave absorber in order to give the boundary conditions exactly at the end 
of the wave tank (no flux: d(j)/dn = 0). The time-stepping method uses a 
constant time-step At « 10““^ s, i.e. the real time simulation of « 10 s has 
exploited the BEM in the order of 10^ times, without any instabilities. 




Fig. 4.1. Free surface wave elevation; comparison of experiment and BEM 

computation 



While the results of Fig. 4.1 are achieved with a constant time-step. Fig. 
4.2 compares the results with and without time-step control. Here the wave 
elevation is given for another test case. Reference (solid line) is the BEM 
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time-stepping procedure with At = 0.03 s. The time-step is controlled by the 
CFL condition (3.12) and starts with At = 0.12 s. The wave elevation with 
time-step control shows some oscillation for 6 s < t < 8 s but still without 
instability, ref. Fig. 4.2. During computation the step size is changed, de- 
pending on CFL. Fig. 4.3 shows that CFL is larger with time-step control 
(dashed lines) compared to CFL with fixed time-step At = 0.03 s (solid lines). 
As stability or control criterion CFL < 0.7 shows best results. Besides giving 
a reliable stability criterion, this enables a CPU-time reduction (speed-up 
2 3). Space discretization is fixed to the finest grid. 




t 



Fig. 4.2. Free surface wave elevation; computations with (At = /(CFL, t)) and 
without (At = const) time-step control 




Fig. 4.3. Time histories of time-step and CFL condition with (dashed lines: 
At = /(CFL, t)) and without (solid lines: At = const) step-size control 
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4.2 Trajectories of Fluid Particles Inside the Domain 



The Lagrangian formulation (2.4) of the free surface kinematic boundary 
condition enables a simple approach for particle tracing on the free surface. 
A detailed analysis of fluid particle dynamics near fixed or floating structures 
is given by [9]. In some cases, [17], among free surface flow the fluid domain 
is of interest also — measurement becomes here most difficult. The described 
BEM time-stepping analysis of wave-structure interactions allows for particle 
tracing inside the domain, see Fig. 2.1 

^end 

^fW=^f(*o)+ J wdt. (4.1) 

<0 



Once all boundary data is given on the boundary r{t) of the fluid domain 
n the flow field w = V<^ can be expressed at any point ^p. The BIE (3.2) 
now becomes 



d4>i$p) _ f d<t>{x) dG{x,i^) 

dxi J dn dxi 
r 



G{x,^p) 

dndxi 



(t>{x) dr(x), 



(4.2) 



with Xi the 
solution is 



cartesian coordinates. The gradient VG of the fundamental 



^ ^ ^ 

dxi ~ 7 T ( rTr )^“^/2 ’ 



(4.3) 



where dim denotes the problems dimension, and ri are the components of the 
distance vector r = x— ^p. The normal derivative of the gradient V(V^ G) n 
reads 



d‘^G _ dim n r^n Ui . 

dn dxi ~ 7T tt (r'^r)^”"/^ ’ ' 

with Hi components of the normal vector n. 

The basic result of particle tracing inside the domain shows Fig. 4.4 for 
the simulation of linear gravity waves. In this case particle dynamics is de- 
scribed by wave theories. Numerical simulations with the BEM time-stepping 
procedure pays regard to three significant items of linear wave theory, see Fig. 
4.4: 



- in case of deep water, with d/X > 1/2 , particle trajectories are circles with 
decreasing diameters for greater water depth, 

- down to the bottom of the wave tank an increasing horizontal movement 
in case of shallow water waves {d/X 1 ), 

- in all cases the vertical component of the fluid particle orbits is vanishing 
near the bottom. 



Note that Fig. 4.4 shows no circles due to different scaling of x, y-axes. 
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Fig. 4.4. Numerical results of particle-tracing; fluid particles inside linear gravity 

waves 



4.3 Absorbing Boundary Condition 

In order to avoid reflections linear wave tank analysis of wave-structure in- 
teraction makes use of wave absorbers at the end of the flume. Among many 
others, ref. [16], in case of a BEM time-stepping procedure a modified dy- 
namic free surface boundary condition (2.5) can be used 

D<i(rF, t) 1 IwP . . . . 

= Pamb - gyp + ^ MR^(rF, t) • (4.5) 

The added Rayleigh damping /xr, </>(rp, t) results in an irrotational, invis- 
cid dissipation of energy. Compared to other approaches the main advantage 
again is in context of BEM: on certain nodes of the boundary discretiza- 
tion the boundary data is modified. This is easily implemented in the BEM 
time-stepping method. 

Anyway, now the parameter /xr of Rayleigh damping has to be identified. 
Best result is achieved with a linearly increasing distribution (up to /.tRmax) 
on the damping zone, ref. Fig. 2.1. If the length £r of the damping zone is 
adapted to the wave length A an optimal parameter value //Rmax = 4 s“^ 
is found for a wide range of different applications (wave height, steepness, 
frequency) . 

Fig. 4.5 gives the damping of a wave group. Here the wave elevations 
at different positions in the fiume are depicted. Experimental and numerical 
BEM data show very good agreement at these points. Hence, the Rayleigh 
damping boundary condition (4.5) pays regard for the absorption of the wave 
group in the wave tank. In addition Fig. 4.5 shows the numerical results in 
case of no artificial damping, i.e. no fiux condition at the end of the flume. 
Large reflections are evident. 
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4.4 Pressure Computation 

The main aspect of wave-structure interaction is the hydrodynamic pressure 
in case of large amplitude gravity waves. This gives the dynamics of floating, 
and the loads on flxed or moved structures. The basic example of wave- 
structure interaction in wave tank experiments is the wave generator problem. 

In order to proof modeling and accuracy of the proposed pressure compu- 
tation the BEM time-stepping results are compared with experimental pres- 
sure data at hand. Fig. 4.6 shows the hydrodynamic pressure pdyn (without 
the hydrostatic component) at different positions D1 . . . D5 on the flap type 
wave generator. The positions (depth) with regard to SWL are: D1=0 mm , 
D2=25 mm , D3=75 mm , D4=175 mm , D5=375 mm. Boundary conditions 
on the wave generator are given as input for the BEM by means of additional 
measurement data (position and acceleration). Fig. 4.6 depicts the hydrody- 
namic pressure Pdyn- Solid lines give the results of experiments, and symbols 
the results of BEM computations, respectively. 

The direct comparison of experiment and BEM computation shows that 
the proposed numerical methods gives wave-structure interaction in detail: 

- shocks during starting procedures, 

— large amplitudes of the hydrodynamic pressure due to sloshing effects, 

- emerging of pressure transducers (Dl), 

— negative hydrodynamic components in greater water depth. 

5 Constrained Floating Bodies 

The interaction of gravity waves with fixed or floating structures is a highly 
developed subject and many mathematical models are available in case of 
small amplitudes, [11]. For either large amplitude waves or large or con- 
strained motions of structures most of the commonly used models failed due 
to their restrictions. A typical example of freely and constrained floating 
bodies is depicted in Fig. 5.1. 

Time-domain simulations of floating structures in free surface flow by 
means of equations (3.1) and (3.4) allow a more detailed analysis of these 
considerably complex wave-structure interactions. The described procedure 
can generally be applied to a wide range of different problems. Besides, the 
hydrodynamic loads acting on the structures, this formulation gives e.g. the 
moving boundaries, the velocity potential, and the flow field even in the case 
of large amplitude motions. 

Fig. 5.2 shows the initial discretization of the aforementioned exam- 
ple in the area of the floating structures (Body 2, Body 3 ref. Fig. 5.1). 
The time evolution of the moving boundaries (free surfaces and submerged 
structures) is given in Fig. 5.3. Time histories of generalized coordinates 
q ^ 2 (^)]^ 2 tre depicted in Fig. 5.4. With these results the position 

and velocity of arbitrary points on the rigid bodies (e.g. center of mass) can 
also be given by (time-dependent and nonlinear) kinematical relations. 
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Fig. 4.6. Hydrodynamic pressure on a flap type wave generator; comparison of 
experiment (solid lines) and BEM computation (symbols) at the positions 

D1 . . . D5 



6 Conclusions 

A boundary element formulation for the analysis of fluid-structure interac- 
tion problems considering nonlinear gravity waves is presented. Because all 
quantities of interest are located on the boundary itself and regarding recent 
developments in boundary element methods (BEM) this is a reliable, most 
efficient approach with minimal discretization expenses. The major advantage 









Wave-Structure Interaction 



165 




Fig. 5.4. Time histories of generalized coordinates q(t) 

of a BEM formulation is the direct numerical implementation of the equa- 
tions of motion in an explicit form: for fluid particles on the free surface and 
for freely or constrained floating bodies. The coupling with a time-stepping 
procedure allows even for time-dependent and nonlinear conditions. 

In particular this approach provides the extension to a numerical wave 
channel concept. The proposed method is applied to numerical simulations of 
wave tank experiments. Besides a direct comparison with the BEM compu- 
tation, measurement data provide a more detailed analysis of the numerical 
scheme, e.g. with regard to space and time discretization, order of discretiza- 
tion, types of boundary elements, CPU-time requirement, stability criterions 
or pressure formulations. 

A criterion is given that allows for time-step control of a higher order 
Runge-Kutta method, and pays regard to BEM discretization in case of mov- 
ing boundaries. The BEM time-stepping procedure is a simple approach to 
trajectories of fluid particles on the free surface and inside the domain. Par- 
ticle tracing shows good agreement with basic analytical results of linear 
wave theory. The presented pressure formulation is independent of the form 
of application. The computed hydrodynamic pressure takes hold of time- 
dependent, large amplitude motion (water sloshing). The main features of 
this approach for freely and constrained floating body (multi-body) dynam- 
ics are presented. In context of BEM: the analysis takes full advantage of 
physical data at discrete points (nodes) of the boundary only. 
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Integral Equations arising from Instationary 
Flows around Thin Wings 

R. Hinder and E. Meister 



1 Introduction 

Within the scope of a three-dimensional linear theory we analyse a mixed 
screen boundary value problem for the Helmholtz equation {A -h = 0 
where ^ is a perturbation velocity potential, generated by the presence of an 
oscillating wing in a basic flow. 

Not all Cauchy data for ^ are given explicitly on the two sides of the 
screen. The missing Cauchy data depend on the wing circulation F and must 
be evaluated by the Kutta condition: is flnite near the trailing edge Xf of 

the wing. Using the boundary integral method we transform this mixed screen 
boundary value problem into the hypersingular boundary integral equation 
= h{F), where Dj^ denotes the normal derivative of the double layer 
potential with respect to the projection L of the lifting surface on the screen, 
the unknown density ^ and a right-hand side h depending on the unknown 
wing circulation F. To fulfil the Kutta condition in such a way that all ap- 
pearing terms can be defined mathematically exactly and belong to spaces 
which are physically meaningful, we propose to fix F by the condition of 
vanishing stress intensity factors of ^ near Xf up to a certain order such that 
^^\xf ^ W 2 {xi) C L 2 {xf) for appropriate cr > 0. 

In the two-dimensional case, and if L is the left half-plane in R^, we have 
an explicit formula to calculate F and we can control the regularity of F and 

In the case of a polygonal wing projection L we propose a strategy to 
calculate F by minimizing the stress intensity factors near Xf in the sense of 
least squares. 



2 The problem 

Our aim is to make a contribution to answer the question why aeroplanes can 
fly. Of course, one answer to this question is, that aeroplanes fly on account 
of a higher pressure on the lowerside than on the upperside of the wings of 
the aeroplane. But to give more precise answer to this question one needs 
a detailed knowledge of the distribution of the pressure, respectively of the 
velocity, at the sides of the wings. 
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One main aim for a mathematician working on an engineering problem 
is to prove the convergence of numerical algorithms for the mathematical 
problems. But in order to do this, one needs a detailed knowledge about 
solution spaces and their corresponding norms, i.e. one needs an existence, - 
uniqueness- and regularity theory for the specific problem. But such a strict 
mathematical theory is, up to now, not available for lifting surfaces, especially 
in the full three-dimensional situation. One even cannot define fundamental 
quantities appearing in the models without mathematical ambiguity. 

Of course, a lot of models for calculating the lift, which have been realized 
on computers, can be found in the literature using modern numerical tech- 
niques, e.g. the evaluation of singular and hyper singular integrals. But the 
important fundamental problem of the correct solution spaces has not been 
solved. 

Hence, one needs particularly existence,- uniqueness- and regularity re- 
sults in such a way that the appearing quantities can be defined exactly and 
belong to physically reasonable spaces. 

This philosophy is the starting point for our investigations. Of course, in 
these first attempts we have to admit a lot of idealizations, which are not 
approved from the practical point of view. 

We now formulate under certain idealizing assumptions within the scope 
of a linear three-dimensional theory a model for the calculation of a pertur- 
bation velocity potential generated by the presence of an oscillating wing 
in a basic subsonic fiow, which seems to be suitable for proving results like 
those mentioned above. 

Starting point for our investigations is a fourthysix years old paper by 
Reissner [18], who formulated under similarly idealizing assumptions condi- 
tions which have to be fulfilled by These conditions have been reformulated 
by Meister [13] in the two-dimensional case as a more handable model and 
based on his ideas by Hebeker [8] in the three-dimensional situation. This 
model, which we shall recall shortly, has been studied in more details in the 
papers [9] and [11]. 

We assume that the gas fiows with a constant subsonic velocity in the pos- 
itive x-direction and is inviscid, barotropic, compressible, and weakly damp- 
ing. Furthermore, we assume that the velocity of the fiowing gas is small, 
compared with the speed of sound. 

We assume as an obstacle a thin, weakly cambered, time harmonically 
oscillating wing into the region of the fiowing gas which is divided into an 
upper and a lower part, respectively, by the (x,y)-plane. 

Note, that the vorticity theorem of Helmholtz implies the existence of a 
region of vortices behind the wing, the wake, which we have to admit in the 
model. 

On account of our assumptions (small velocity, thin and weakly cambered 
wing), the normal component of the velocity is prescribed on the wingsurfaces 
by its form ([1], [18] and others). Therefore, one can represent the vertical 
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component of the velocity on the upper and lower side of the wing as functions 
on the projection L of the wing onto the (x,y)-plane. 

Now one postulates within the scope of a linearized theory the existence 
of a perturbation velocity potential induced by the presence of the wing, 
such that 

Total potential = perturbation potential ^ -h potential without wing, 

V" ' " ^ 

” small" 

for which one calculates the boundary conditions linearized in the (x,y)-plane 
[1, 18]. 

The functions, obtained from the requirement of a total vanishing normal 
velocity at the wingsurfaces, can be split into a symmetric and an antisym- 
metric part. It is known, that the symmetric part gives no contributions to the 
lift, so that we can ctssume a given Neumann data g = ^ on the projection 
L of the wings onto the (x,y)-plane [1, p. 210]. 

We agree upon the following notations and assumptions: 

L denotes the projection of the wing 1 
W denotes the projection of the region > onto the (x,y)-plane. 
of shed vortices J 

We assume: 

— L is a bounded, simply connected Lipschitz domain. 

- The two straight lines bounding the strip I touch the boundary dL of L 
exactly in one point, the tips of the lifting surface. After removing these 
two points from dL, it is divided into a leading edge xj and a trailing edge 
Xf, which we assume to be parametrisized by = xi(y),xf — Xf{y),y E 

Every straight line in the strip I, and parallel to the x-axis, 
is assumed to intersect x^ and Xf in exactly one point [2]. 




In the following we denote by S the open screen L U x^ U W where W := 
{{x,y) : y e {ymin^ymax),x > x^(j/)}. 
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Finally, we agree that the normal vector n on 5 is the outer normal of 
{{x^y^z) £ : ^ > 0} and that the potential jump on S is the 

“limit from to 5 minus the limit from R^ to 5” and denote this limit 
with [^], respectively, . 

Now we can formulate the problem of calculating the perturbation velocity 
potential ^ explicitly. 

The perturbation potential ^ (induced by a time harmonically oscillating 
wing) has to fulfil — after a reductionprocedure — the Helmholtz equation 

(A -h = 0 in R^\5, Re A: > 0 , Im A: > 0 (2.1) 



where k denotes the reduced wavenumber of the oscillating wing ([1], [13]). 
Re A:, Im A: > 0 hold on account of our assumption of a weakly damping gas. 

The normal derivative ^ on L is determined by the assumption of a 
vanishing normal component of the total velocity at the wings: 



•= 9{x^y) on L. 



It is known ([1], [18]), that 



and (2.2) implies 






dn 



dn 



= 0onW 



= 0 on L. 



( 2 . 2 ) 



(2.3) 



(2.4) 



In the case of a time harmonically oscillating wing the perturbation pres- 
sure ps and the perturbation velocity potential ^ are connected by 

Ps {x,y,z)=c{x, y) (^ - i-^) ^{x,y,z) (2.5) 

with a continuous function c and the Machnumber M (here: M < 1). 

It is well-known that the wake cannot carry a jump of the pressure: 



\ps] = 0 on IF. 



( 2 . 6 ) 



Furthermore, we assume a condition of finite energy: 

^£W^(bP\S), (2.7) 

where denotes the usual Sobolev-Slobodeckij space. 

Finally, ^ has to fulfil the Kutta condition in a sense prescribed precisely 
in latter paragraphs. Classically, (e.g. [18, p. 826]) the Kutta condition is 
usually formulated in the form: 
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The perturbation velocity remains finite by an ^ 
(three-dimensional) approach > . (2.8) 

to the trailing edge of the wing projection L J 

Following [13], [8] we reformulate some of these conditions in form of a 
more suitable model. 

From (2.5) und (2.6) we obtain 

0 = \Ps]w = <-^Ww-i-^mw). 

i.e. (x,y) = aiy)exp 

a{y) can be expressed by the wing circulation F. Let 

^ (x, y) := [^] I (x, y) on L (2.9) 

denote the unknown jump across L. For ^ (R^\5) the jump [^]g 

across S is definable in a senseful way, but is in general not a continuous 
function in a neighbourhood of Xf. Our definition of the Kutta condition will 
rise the regularity of ^ in fl U and R^ D U with an open subset U C R^, 
xi CU, such that [^] is continuous on Xf. Hence, we obtain with = 0 on 
the leading edge 



H(y) 

/ k \ f dW 

^(xt{y),y) = a{y)exp = / -^^x =: F{y), i.e. 

^liy) 

[^]w (^^y) = ^(y)^^p ( 2 - 10 ) 

Collecting these facts we can formulate the complete problem as follows: 
Find # e VFj(R3\5) with 

(A + = 0 in R^\iS , Re A: > 0 and Im A: > 0 

mi ^ 

[^]w = r{y)exp(i^ix-xtiy))^ 






L 



^ = 9onL 

|V^| < 00 on (Kutta condition) 



} ( 2 . 11 ) 
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= 0 



In (2.11) Qj fc, M and are given, F and, of course, ^ are unknown. 
Thus, our problem is a mixed screen - boundary - value - problem 
with partly unknown Cauchy data. 



3 Mathematical ansatz 

In this section we reduce problem (2.11) to the calculation of the wing cir- 
culation T, where we cannot apply the high standard of accuracy involved 
usually in this paper by treating difficulties coming up near the tips of the 
wing. Concretely, we transform (2.11) into the hypersingular integral equa- 
tion (3.14) on L. Then, we formulate regularity conditions for T, respectively 
for functions depending on T, such that constructed by using the solu- 
tion of the boundary integral equation (3.14), solve (2.11) with (possible) 
exception of the Kutta condition, i.e. (2.11) is reduced to the calculation of 

r. 

During the last years a lot of papers concerning screen problems with 
smooth and bounded edges have been published (i.e. [19], [20]). Contrary to 
these papers, our screens L and S are not as such smooth. Furthermore, the 
boundary dS of S is not bounded. But combining results of Costabel [3] who 
investigated elliptic equations on Lipschitz domains and of v. Petersdorff [15] 
who considered the Helmholtz equation under certain boundary conditions 
also for domains with unbounded boundaries and Stephan [19] who treated 
the Helmholtz equation in the exterior of a smooth and bounded screen un- 
der Dirichlet- and Neumann conditions we can formulate results of potential 
theory adapted to our situation which we present below (the proofs can be 
found in [9], [11]; There we took advantage that our screens L and S are plane 
and that S is with the possible exception of a bounded set a rectangle). 
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In order to formulate these results we recall briefly the definition of some 
well-known spaces. In the following we denote by W|(/2) for a domain f? C 
R” the usual Sobolev-Slobodeckij spaces and by H\n) the Bessel potential 
spaces (e.g. [22]). Here the Fourier-transformed ^ of a tempered distribution 
ip is normed by ip{u) ;= f <p{r})exp{i <t],u >)dr}. Furthermore, we need the 

spaces := {ip G : supp ip C O} with the //®-norm of the 

extension by 0 as norm. 

For functions 

^ € //I (r|) := [ip G hHrI) : II^H^1^(r3 ) 

||(^||2 . . 1 wtA I l 2 \_ii 2 ^ 



„1(r|)+II(A + *W^^(j^|)<cc) 



we can define the normal derivative := ^ as usual (e.g. [3], [19], [9], [11]) 
by 

71 : //),(R|) — 1 i/-l/2(R2) >1 

V 23 (/«<7HP./>^^(r2,) I 



71^ : h1/2(j^ ^ C 

< 7l <^,/ ^ f 7 q f ■ + k^)^dV 

”-± 

± / VpVjQfdV Tk^ J (PIQ fdV 



Here 7 q : i/^/^(R^) — > ff^(R^) denotes a right-inverse of the trace oper- 
ator 7 q. 

We need the following definitions of the potential jumps: 

Definition 3 . 1 . For ip ^ L2(R^) with ^ L3 € i/p(R^) and 

1 Q 

V>+ ■= f lj^3 € Hp(R^) the following jumps are well defined: 

Mr2 “ (toV’-) - (toV^-I-) 1 

K]r2 •= ]' 

The following theorem holds: 
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Theorem 3.1. (representation formula) 

Let M denote one of the screens L or S and let ^ ® weak 

solution of the Helmholtz equation in R^\M with Re k > 0 and Im k > 0. 
Then in R^\M the following representation formula holds: 



${u) = - j [<f] iO-^^G{u,Ods^+ J [1^ {OG{u,Ods^,u € r\m 

(3.4) 



M 



M 



with the fundamental solution 



-4w I It - ^ I 



(3.5) 



of the Helmholtz equation in the three-dimensional case. 
For the densities we have: 



[$] e hV2(m), 



dn 






(3.6) 



The proof is presented in [9], respectively [11], where we used ideas from 
[19] and results from [15]. The main idea is to extend M to R^ and to use 
the well-known representation of by a combination of the single layer and 
the double layer potential in the upper, respectively the lower half-plane of 
R^. Using that ^ is bounded away from M smoothly, the statements of the 
theorem follow. 

Finally, we need the following 



Lemma 3.1. 

Let M denote one of the screens L,W or S. Then the operators of the double 
layer potential 



> //1(r|), respectively, ' 

Km ■ hV2(m) — ^ 

<P - I <p(0-^G(u,Ods^ 

M s 



(3.7) 



and the operator ^‘normal derivative of the double layer potential” 

Dm ■ ^ 1 



are continuous. Furthermore, the following jump relations hold: 
bo^M‘P] = ‘fi’ = 0 /or G 



(3.9) 
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Remark 3.1. 

In the literature screen problems for smooth and bounded screens are usually 
treated by extending the screen to a smooth, closed manifold dQ such that 
dQ is the boundary of a smoothly bounded domain i? (e.g. Stephan [19]). In 
this case one is searching for solutions which fulfil the Sommerfeld radiation 
condition and one can admit real values for the reduced wave number k. 
Then, for (p £ one extends (p to l(p £ by zero and 

obtains {Kj^I(p)\q G H^{0) and Hence, 

one obtains solutions in the space ^(R^\M). 

Because the screen S is unbounciS, we have to require Re A: > 0 and 
Im A; > 0 to find an unique solution of the screen problem with S as screen 
in physically adequate spaces with a reasonable behaviour at infinity. We 
also used the idea outlined above: We have extended S to x {0}. O 
corresponds to the upper halfspace R^. (3.7) implies ^ 

and ^ Nevertheless, the image 

is not contained in //^(R^\M): The double layer potential “jumps” 
by passing through the manifold M. So one cannot expect the existence of 




if e Kj^ip e W"j(R3\M), because H^{R^) = and, as 

a solution of the homogeneous Helmholtz equation, Kj\^p is smooth away 
fi:om M. 



Under the assumption of a known circulation F we now derive a boundary 
integral equation for the unknown layer If' on L in order to find conditions 
for the calculation of F. 



Because 






JL 



_ \d^ 



iW 



0, the representation formula (3.4) implies 



for a solution ^ £ wi (R^\*5) of the problem (2.11) 



^{u) = - 




(3.10) 



Inserting the Cauchy data for [<f] in (3.10), we obtain after application of the 
normal derivative Ji 



- a;* (^ 2 ))) ■^G{u,Ods^. 
W s 



Passing to the limit u = (ui,U2,u^) — > (uj, U2, 0)= : u we obtain together 
with ^ = g on L and (3.9) for u G L the boundary integral equation 
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DiJP{u) = g{u) + ^ / m2)^xP - x^(^2))) 

'^W ^ 

(3.11) 

for 

Additionally, the boundary conditions 

tf^ = 0 on XI \ . . 

<? = r on xt j ^ ^ ^ 

have to hold. 

But it is well-known that boundary integral equations coming up from 
screen problems generate “zero boundary values” for the solution at the 
boundary of the screen. One has to expect ^ G (e.g. [19] or theorem 

3.1), which is not compatible with the condition ^ = F on the trailing edge 
Xf. The circulation F cannot vanish on Xf. Otherwise = 0 would hold. 

Therefore, following Hebeker [8], we introduce a smooth cut-off function 
X which is identical to 1 on Xf and 0 on xi with the exception of small 
neighbourhoods of the tips of L. The substitution 

$(ui,U2) :=^(ui,U2) - x{ui,U2)r{u2) (3.13) 

transforms (3.11) into the boundary integral equation 

Dj[^^{u) = h{u), u G L (3-14) 



with 



h{u) 



-h 

+ 



^ I n^2)^xp - Hi.^2))) 

W ^ 

& / ni2)x{0-^GMds^ 

F > j 



(3.15) 



Then ^ vanishes on dL up to small parts of dL. 

The behaviour of [^] near the wing tips (side edges of the wing!) has to be 
analysed in more detail from the analytical as well as from the physical point 
of view. One needs a model which includes the shedding of vortices from the 
tips (side edges!) of the wings. _ 

But we think that the error which results of the non-vanishing of ^ on 
dL near the tips of the wings can be neglected within the scope of this 
model, because the wings are on account of our assumptions narrow near 
these points, i.e. we expect that r(y), as a line integral along cross-sections 
of L, becomes small there. Results by engineers also support that F becomes 
small near the tips of the wings. 

The following lemma holds: 
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Lemma 3.2. 

The boundary integral equation (3.14) ^ is for h £ 

uniquely solvable with the solution ^ 

The idea of proving lemma 3.2 is standard (e.g. [19]). The detailed proof, 
adapted to our situation, can be found in [9], [11]. Using that the operator 
“normal derivative of the double layer potential” is a pseudodifferentialoper- 
ator with symbol 

= (3.16) 

(e.g. [6]), i.e. 

Dl^ = - fc2 . (^ e (3.17) 

where pj^ denotes the operator of restriction onto L and F, respectively 
the Fouriertransformation (branch-cut of the square root equal to R_), 
one shows in our situation of a plane screen by elementary calculations the 
coercivity estimate 

I < >L2(L) I ^ (3.18) 

and that a{(p,f) :=< >/;^2(L) bounded on x 

Using lemma 3.1 one can prove the 



Theorem 3.2. 

Let r denote a function defined on the trailing edge Xf of L with 



with 



HO- 



g € H-1/2(l) 
h, 6 h1/2(s) 



^(^2) -X(6 for ^ e L 

r{^2) for ^ e arf 

r(^2yxp(i^{^l- xt{^2))] for ^ G j 



' no for ^£L 

[#]^(^) = J no) for i£xt 

[ r{0)exp[i-^{0-xt{0))) fo^ J 

and 

m)=m)+no)-xio 

(compare (3.13)) and the solution of (3.14). 

Then, the double layer potential (3.10) with the density 



(3.19) 



(3.20) 

(3.21) 

(3.22) 
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Hu) := (%[^]5) {u) = -j msiO-^G(u,Ods^ (3.23) 

5 ^ 

is a solution of (2.11) with (possible) exception of the Kutta condition. 



Remark 3.2. 

We saw in this paragraph that the main problem is the calculation of the wing 
circulation F in such a way, that the assumptions in theorem 3.2 and the usual 
Kutta condition are fulfilled. The Kutta condition has been postulated to fix 
^ uniquely. This point of view is reflected in our model. 

It turned out that it is a non-trivial problem to define the circulation 
r meaningfully. It is well-known that spaces like or, in our situation 

on account of a weakly damping gas, are canonically solution spaces for 
screen problems for the Helmholtz equation. This fact coincides with physical 



1 — 

arguments: ^ (R’^\*S) means that # and the partial derivatives of ^ of 

order 1 {= velocity components) are square integrable, i.e. these spaces are 
adapted to a finite energy, respectively a locally finite energy. (2.11) shows 
that r is just the restriction But on account of the poor regularity 

of ^ it is not possible to define this restriction by twice applying the (sharp) 
trace theorem. Furthermore, will blow up as Xf is approached. 

We need more regularity near Xf. This additional regularity must be 
gained by the Kutta condition. Hence, we have to specify the classical defi- 
nition of the Kutta condition for our purposes. This idea leads to the 



Definition 3.2. 

We say that the Kutta condition is fulfilled, iff^£ ^(F±), where 

o 

denotes the intersection with a small three-dimensional neighbourhood 

U of Xf and a positive number a. 

Of course, if ^ fulfills the Kutta condition in the sense of definition 3.2 
one can define F by traces and ^ (xf). 

In the next two paragraphs we demonstrate in the half- space situation, 
respectively in the pure two-dimensional case, where we can calculate F an- 
alytically, how the Kutta condition can be fulfilled. 



4 The solution of problem (1.11) in the case of the 
wing projection L equal to the left half-plane 

cy 

In this paragraph we constantly denote by L the “left half-plane” in R^, i.e. 
L := {{x,y) G R^ : x < 0}. Hence, W — {{x,y) £ R^ : x > 0}, 5 = R^ and 
Xfiy) = 0 V 2/ ^ R. 
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For calculating F we can choose the cut-off function x (3.13) in the half- 
space case independent of the second argument. More precisely, we choose % 

as a twice continuously differentiable extension of exp from Rq. to 

R with an appropriate behaviour at infinity, e.g. 



x(C) = x(^i) — I 



6 exp - 32 exp 

+27 exp 



for 

for 



With the abbreviation _ 

m ■■= n^2)xih) 

we can write the boundary integral equation (3.14) as 

D^nu) - g{u) + -£- I rio£-GMds^ 

r2 ^ 



^1 >0 ' 

^ . 

Cl <0 

(4.1) 

(4.2) 

(4.3) 



on L with the additional condition ^ — 0 on Xf. 

Denoting by p_ the operator of restriction onto L and by an arbitrary 
extension of g £ H^{L) to Ig £ H^CR?) with a real number s, which we have 
to concretize more precisely, we can interprete (4.3) as the pseudodifferential 
Wiener-Hopf equation 

p_DW =:p^{lg- DF). (4.4) 



Here again D denotes the pseudodifferential operator ‘‘norma l derivative of 
the double layer potential” , generated by the symbol D(^) = ~ 

If the right hand-side of (4.4) belongs to the space V^(L), we know 

from lemma 3.2 that (4.4) possesses an unique solution ^ e But 

the proof of lemma 3.2 takes no advantage from the half- space geometry. 

Using ideas and modified techniques of Eskin [6], who makes use of the 
half-space geometry, we can give an explicit representation of the solution 
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$ if p_{lg - Dr) £ if-V2 + (^(L) where 0 < (t < ^. If p_(lg - DF) £ 

gpUj ^ additiv into a regular part ^reg{D) and a singular 
part where the representation is explicit and especially explicit 

with respect to the unknown circulation F. The regularity result ^reg ^ 
^ 3/2 + 

It is well-known that solutions of boundary integral equations like (4.3), 
respectively (4.4), behave like ^/r near the boundary dL of the screen L 
where r denotes the distance to dL (e.g. [19]), i.e. behaves like This 

asymptotic behaviour is not compatible with a finite perturbation velocity 
near the trailing edge Xf. Responsible for this poor regularity is the singular 
part Hence, we try to eliminate by using our last existing 

degree of fi:*eedom, i.e. we want to solve the equation 

for the unknown circulation F. This equation can be realized by the condition 




(The details are published in [9], respectively in [11]). 

The right hand-side in (4.6) is of the form a(^2) * ^(^2)* determine 

a(^2) explicitly by calculating at first x(^i) with elementary methods and 
then q:(^ 2) t>y inserting this result into the right hand-side of (4.6) and making 
use of the residuum theorem. As a result we obtain from (4.6) 




Especially, if ^^^^g{F) == 0 we have ^{F) = ^reg{r) € ^(^)- 

If we assume even the still higher regularity g £ and choose 

Ig € ^(R^) we obtain from (4.7) F £ ^({0} x R) and DF £ 

7/1/2 -f g ^Yie assumptions for splitting ^{F) into 

^reg{L) -h ^sing(^) fulfilled. 

Furthermore, the density [^]|5 consisting of the sum of the solution 
and T'(^2 )^(^i) (resulting from the resubstitution according to (3.13)) on L 

and r(^2) ' ^ belongs to ^(R^). This leads to the 




Integral Equations arising from Instationary Flows around Thin Wings 181 



Theorem 4.1. 

Let g £ ^(L) with 0 < cr < ^. Let r be as in (4-^) with an arbitrary 

extension Ig £ ^(R^) of g. Then the double layer potential 



^iu) = -l[$]siO-^G{u,Ods^ 

S ^ 

with the density 

r^i /c^_^ ^(0 + n^2)x{h) for C&L \ 

^ \ r{i2)exp for j 



(4.8) 



(4.9) 



solves problem (2.11). 

Especially q ^ q ^ holds, i.e. the Kutta 

' R*^ 

condition is fulfilled in the sense of definition 3.2. 



5 The solution of problem (1.11) in the purely 
two-dimensional case 

The techniques of proving the representation of in the last para- 

graph are based essentially on the Wiener-Hopf method which displays its 
power especially in the half-space case. If L is a more general screen, one 
needs a newjdea in order to determine F because explicit representation 
formulas for ^ or even are not available. 

But during the last years several authors proved results about the be- 
haviour of solutions of special boundary value problems in two- and three- 
dimensional domains, respectively, near edges and corners of the boundary 
of the domain. We want to use these outcomes. Hence, we transform the 
boundary integral equation (3.14) into a screen problem with Neumann data 
prescribed on the screen L. Then, we interprete the exterior of the screen L 
as a degenerated Lipschitz domain, fix F by the requirement of a vanishing 
singular part of the solution of this screen problem and analyse the regularity 
of ^ constructed with F choosen in the described manner. 

We present this idea here at first in the two-dimensional catse, where we 
also can calculate F explicitly. In the two-dimensional case the wing projec- 
tion L is an interval and W is the interval (x^,oo). Especially F is 

then a complex number and not a function. 

Starting from (2.11) in the two-dimensional case one obtains with the 
methods outlined in 2 correspondingly the hypersingular boundary integral 
equation 



Dj^^{u) = h{u) 



(5.1) 
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with the normal derivative Dj^ of the double layer potential. Because and 
r are scalars now, the right hand-side h is given by 

h{u) = g{u) + rexp I 

W S 

(5.2) 

where G(u,^) denotes now the fundamental solution of the Helmholtz equa- 
tion in (Details can be found in [9] or [11]). One can show that the 
boundary integral equation (5.1) is equivalent to the following screen bound- 
ary value problem: 

Find ip € W^j(R2\L) with 

(A + k^)(f = 0 in R^\L, Re A: > 0, Im /t > 0 

i? = honL 

For fixing F we assume, similar to the last paragraph, a higher regularity 
for h. More precisely, we require h E ^(L) where 0 < cr < -j. Then 

the unique solution cp E W^i^xL) of (5.1) possesses the representation 

ip{r) = ipreg(r) + ipsing(^) 

^singin = Ci(OQv/i^cos^ + ct(r)C^^cos^ 

where we indicated the dependence of the appearing terms on F explicitly. 

Here c^(r) and Cf{F) denote complex numbers independent of the spheri- 
cal coordinates. Q und Ct ^^e smooth, two-dimensional cut-off functions cen- 
tered in (x^,0), respectively (x^,0), whose supports are contained in dis- 
joint domains i?;, respectively f2f. Finally, we denote by (pi,6i), respectively 
(pt,0f), in (x;,0), respectively, (x^,0) centered polar coordinates. 

For the regular part of (p the following result holds: 

Pregin eW^ + ^iB?\L). (5.5) 

X 

W 
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In order to fulfil the Kutta condition we try to fix the circulation F by 
claiming = 0 on i.e. we try to evaluate F by the condition 

cf(r) = 0. 

But, up to now, it is not possible to determine ^reg^ respectively ^g{Yig'> 
i.e. we don’t now the explicit dependence of on F. 

Thus, we try to choose T in a way that that part of ^{F) is filtered out 

which behaves like y^cos^. 

To do this we augment - after a normalization - the singular function 

other functions ^2’ ^3’ " ’ ^ complete orthonormal 

system in L 2 (>T?^). With respect to this function system we expand ^{F) in 
the form 

00 

^ We require then that the scalar factor 

n — \ 

corresponding to in this expansion vanishes, i.e. we demand 

/<^(r)Ctv^cos|dy = 0. (5.6) 

The choice of the L 2 ~product in (5.6) seems to be unnatural but we can 
show that (5.6) is fulfilled iff 

< ^(r),C^v^cos| 0 

holds. 

Using (^(r) = Kj^D'£^h{F) with the double layer potential Kj^ and the 

solution DJ^^h{F) of (5.1) we can solve (5.6) on account of the affine linearity 
of h with respect to F explicitly. The result is formulated in 

Theorem 5.1. 

Let 0 < (j < 2 o,nd g E // + ^(L). Let furthermore be 

F = with 

Z = - J KL{Dl^9{u))CtVPi cos ^dV and 

N = ! K^{Dlhexp[-i-^xt) X 

w s 

S ^^^^'^^^'^’^^ds^))Cty/picos^dV 
L ^ 

where we assume N ^ 0 (Z = 0 is physically unrealistic). 

Then, the double layer potential ^ = Kg[^]g with 
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msio = 



M(a + rxiO 

r 



for L 
for ^ = xt y 
for ^£W 



(5.9) 



solves the two-dimensional version of problem (2.1) where ip denotes the so- 
lution of (5.3). For # € W^j(R2\5) 



^b^±eH2 + <^(/2^±) (5.10) 

holds, i.e. the Kutta condition is fulfilled in the sense of definition 3.2. 



6 A three-dimensional example 

Up to now it is not possible to calculate F in the full three-dimensional case 
analytically, because neither explicit representation formulas for the singular 
part of solutions ^ of the boundary integral equation (3.14) nor for solutions 
if of the corresponding screen boundary value problem like (5.3) are avail- 
able. Only structural explicit representations for have been proven for 

screen boundary value problems with a polygonal screen L (e.g. [16], [17]). 
These representations are much more complicated as in the half-space case, 
respectively the two-dimensional case, because besides edge singularities also 
corner singularities appear. Hence, we can present only a numerical ansatz 
for calculating F within the scope of the least squares method by minimizing 
the stress intensity factor of ip with respect to Xf using the ideas of the last 
paragraph on cross-sections of L. 

We demonstrate this idea here for the case of a screen L with a corner of 
its boundary of opening angle a — ^ dX the trailing edge, but this idea also 
works in the case of arbitrary polygonal screens L. 







Our idea is to transform the boundary integral equation (3.14) again into 
a screen boundary value problem like (5.3) and to describe the singular part 
structural asymptotic representation formulas presented in 
[16], respectively [17]. We recognized that the first edge singularity in the case 
of a polygonal screen is always stronger than the first corner singularity, i.e. 
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it is possible to subordinate the influence of the first corner singularity into 
the first edge singularity, where we describe the influence of the first corner 
singularity by numerical results of Morrison and Lewis [14]. This idea leads 
in the case of a polygonal screen with opening angle a = ^ to the following 

Theorem 6.1. 

Let he h £ and assume the existence of a function F € 

j^3.8146^j^3^ — h on L. Assume furthermore, that X = 2.3146 is 

not a comer exponent. Then, the boundary integral equation (S.14) is equiva- 
lent to the three-dimensional screen boundary value problem (5.3). The unique 
solution ip € W 2 (R^\L) of this problem can be represented near Xf in the 
form 

p{F) = Treg{r) Tsing(^) 

<Psingin = Cm)h\ oinir)^cos^ I (6.1) 

+ C2i02)ht,O^nir)^cos^ ^ 

with preg{r) € ~ ^ (U\L) with an arbitrary e > 0 and a three- 

dimensional neighbourhood U of Xf. Here, Cl?C2 denote smooth cut-off func- 
tions centered in the both arcs of Xf, q the stress intensity factors, pj the 

distance to x^, the distance r to the comer is a variable on the trailing edge 
and are angles resulting from polar- coordinates (compare [16], [17], 

According to the ideas outlined in the last section, we try to fix F such 
that ^ . 

< (KDl^hir)) >L2(Ej r)" ° 

holds for almost all points on x^ with distance r to the corner. Here, 
j = 1,2, denotes the intersection of planes which are orthogonal to the arcs 
of Xf and have the distance r to the corner with domains containing the 
supports of Cl, respectively C2- For calculating F numerically, we propose to 
solve the problem 



(KDihinn,; 

J') ' 



in the L 2 ({r : r G (fi,Ej^^^^)})-noTm with the method of least squares by 
choosing f as a linear combination of -functions. 



7 Concluding remarks 

We note that an acceptable mathematical theory for calculating the pertur- 
bation velocity potential even within the scope of a linear theory, is up 
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to now far away. From the mathematical point of view one needs a more 
detailed representation of the solution of the three-dimensional version 
of (5.3) near Xf in which the stress intensity factors should appear explic- 
itly in dependence of the arc length such that one can formulate and solve a 
corresponding equation for F on Xf analytically. 

New results from Russian scientists let us hope to find a suitable asymp- 
totic representation formula in the case of smoothly bounded wing projections 
L. But for more practically purposes especially wing projections with corners 
are of interest. But in this case, for finding sufficiently explicit representation 
formulas for the solution ip[F) of the screen problem near we think that 
we need a more detailed knowledge of higher eigenvalues and corresponding 
eigenfunctions of the Laplace-Beltrami operator (compare [16], [17]) in order 
to eliminate sufficiently many singular terms in the expansion of ^{F) near 
Xf by choosing F in such a way to increase the regularity of ^ up to an 
order, such that becomes square integrable on Xf. But we expect that 
these eigenvalues are not bounded far away from each other, such that besides 
analytical problems also numerical ones will appear. 

In this manuscript we presented the philosophy to fix the perturbation 
velocity potential by conditions on the one-dimensional manifold In the 
case of a trailing edge with corners the calculation of F will depend essentially 
on the opening angles of the corners such that ^ has to be evaluated about 
four dimensions. At this point the question appears wether a more practical 
relevant model should be formulated in form of a mesh problem holding in 
four dimensions. 

Another problem appears at the tips of the wings. Within the scope of 
the model presented in 2 it is not possible to allow edges for the points of 
maximal ^/-extension on account of further problems of modeling, because 
vortices would leave the wings there. Practical experiences show that F has 
to vanish near the tips of the wings. The circulation obtained by our model 
will not vanish there in general. 

Hence, one has to treat three different kinds of points on the trailing edge 

separately: Points where x^ is smooth, corner points of x^, and boundary 
points of Xf. Here we found analytical solutions only for points where Xf is 
smooth and this happens only in the special situation that Xf consists only 
of one point, respectively Xf is the whole line x^ = {(x,y) G : x = 0}. 

Furthermore, we assumed that the wake W is a semi-infinite strip with 
constant width. From the practical point of view we cannot support this 
assumption. W will grow in the ^/-direction with increasing x as a simple 
observation of an aeroplane or a ship shows, and the wake will not be of 
infinite length. Hence, one has to determine W in a more detailed, time 
depending model by solving a free boundary value problem. 

Nevertheless, we want to present our results because, for the first time, 
we can touch the main problems of lifting surface theory which will be, as 
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we think, also of interest for more directly, pratically developments. The 
calculation of F is not trivial and requires further investigations! 

The problems outlined have to be solved in near future, because only 
after a solution of these problems essential progress in lifting surface theory 
could be expected. Also other topics which lead to similar problems, e.g. 
in elastodynamic fracture theory, can be solved only after a solution of the 
problems outlined. 

Besides the questions quoted, we want to list some further problems in 
connection with lifting surface theory for which one needs a complete solution 
of (2.11): 

a. The model has to be adapted to more practical purposes. The assumption 
of a thin, weakly cambered wing is not senseful from the practical point of 
view. Also regions where the flow separates from the wings have to be con- 
sidered. 

b. The assumption of a weakly damping gas is not reliable. One has to take 
into account real wave numbers, e.g. by introduction of weighted spaces or by 
using the limiting absorption principle. Furthermore, it is desirable to con- 
sider the wave number fc as a parameter and to study F and ^ in dependence 
ofk. 

c. wings moving near the ground 

d. correction formulas for a lifting surface in a wind-tunnel 

e. arbitrary time dependence of the motion 

f. proving of the convergence of numerical approximation schemes in flnite 
dimensional spaces which take advantage of the trailing edge situation 

g. programming of a theoretically assured algorithm on the base of graded 
meshes which uses modern techniques for evaluating singular and hypersin- 
gular integrals. 
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Multiscale Methods for the Solution of the 
Helmholtz and Laplace Equations 

W. Dahmen, B. Kleemann, S. Prossdorf, and R. Schneider 



1 Introduction 

This paper presents some numerical results about applications of multiscale 
techniques to boundary integral equations. The numerical schemes developed 
here are to some extent based on the results of the papers [6]-[10]. Section 2 
deals with a short description of the theory of generalized Petrov-Galerkin 
methods for elliptic periodic pseudodifferential equations in covering clas- 
sical Galerkin schemes, collocation, and other methods. A general setting of 
multiresolution analysis generated by periodized scaling functions as well as 
a general stability and convergence theory for such a framework is outlined. 
The key to the stability analysis is a local principle due to one of the authors. 
Its applicability relies here on a sufficiently general version of a so-called dis- 
crete commutator property of wavelet bases (see [6]). These results establish 
important prerequisites for developing and analysing methods for the fast 
solution of the resulting linear systems (Section 2.4). The crucial fact which 
is exploited by these methods is that the stiffness matrices relative to an ap- 
propriate wavelet basis can be approximated well by a sparse matrix while 
the solution to the perturbed problem still exhibits the same asymptotic ac- 
curacy as the solution to the full discrete problem. It can be shown (see [7]) 
that the amount of the overall computational work which is needed to realize 
a required accuracy is of the order 0(iV(logiV)^), where N is the number of 
unknowns and 6 > 0 is some real number. 

We focus here on two problems which are solved by fully discrete colloca- 
tion wavelet methods. Section 3 is devoted to various numerical experiments 
for the exterior Dirichlet problem for the Helmholtz equation. In this case the 
theoretical results for periodic problems apply and are confirmed by the nu- 
merical tests. In Section 4 we present some new results concerning a Dirichlet 
problem for the Laplace equation over three dimensional polyhedral domains. 
Linear systems with « 100,000 unknowns are solved which corresponds to 
fully populated matrices of the same order. In this case our goal is not yet 
to present a fully developed scheme for such a complex three dimensional 
problem. In fact, one should note first that the general theory described in 
Section 2 is not directly applicable to this problem because it is not periodic 
and the boundary is not smooth. Therefore our main goal here is to explore 
to which extent the theoretical predictions from the model problem can still 
be confirmed under these less ideal circumstances. In particular, we focus on 
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compression properties and the convergence behavior of the corresponding 
solutions of the perturbed problem. Therefore we content ourselves here still 
with a rather expensive way of computing the compressed stiflFness matrices. 
Speeding up this portion of the solution process is less dependent on the 
topology of the domain and is meanwhile understood conceptually. Essen- 
tially following the analysis in [10], the practical realizations and numerical 
tests presented here do confirm a similar behavior as predicted by the theory 
for the periodic model problem. 



2 Multiscale Methods 

2.1 Periodic Pseudo differential Equations 

At this stage we focus on the model case of periodic pseudodifferential equa- 
tions to exploit the full advantages of Fourier transform techniques in connec- 
tion with appropriate representations for the class of operators under consid- 
eration. However, we do consider variable symbols and it should be mentioned 
that this class covers all the classical examples such as Hormander’s class, 
in particular, those operators arising in connection with boundary element 
methods. Moreover, most of the methods used here are of local nature and 
thus apply in essence to the case of non-periodic equations, as well (see [8]). 
Consider the discrete Fourier transform by u{k) := u(x)e~^^^®‘^da:, 

k G where x ' k — denote the n-dimensional torus. 

Then a periodic pseudodifferential operator is defined by 

a{x,D)u{x) := ^ cr(x, . 

The function a G C^{T^ x Z"), which is called the symbol of the operator 
cr(x, D), is assumed to belong to a certain clctss for some /i G C. Here 
is comprised of all symbols a of the form cr = (Jq H- cri , where ao G C^{T'^ x 
(E^/{0})) is homogeneous of degree G C, i.e. cro(x, 0) = 1, ao{x,Xk) = 
X^ao{x,k), for A > 0, A: / 0, and \d^y.)T^k)^i{x,k)\ < Ca,/?(1 4- for 

X G T^, k G Z^, and for some r\ < Re /x = r. Here d stands for the partial 
differential operator and r for the partial forward difference operator. By 
we denote the class of operators of the form A — cr{‘,D) -f- K where 
a G r = Re /X is the order of A, and (Ku){x) = k{x,y)u{y)dy with 
k G C^{T'^ X M^) is a smoothing operator. The operator cr{x,D) G A^ is 
called elliptic if, for sufficiently large |A:|, \cro{x^k)\ > c\k\^, x G T^. 

Any A G is a bounded linear operator A : t G M. 

This operator is Fredholm if and only if it is elliptic. Here denotes the 
classical periodic Sobolev space of order t, equipped with the norm HuHt := 
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2.2 Multiresolution and Wavelets 

Multiresolution or Multiscale Analysis (MSA) is by now a well-studied notion 
[16]. Here we focus only on those variants which are useful for our purpose. 
MSA of L^(M) is a sequence of nested closed subspaces ■ ■ • C V_i C Vq C 
Vi C • • • C (M) with 

1. U/P} = LHR), 

2 . r\jVj = { 0 }, 

3. fix) £ Vj ^ f(2x) e 

4. There is a function (p e L^(M) such that the translates {(p{x -k)} (k £ Z) 

form an orthonormal basis of Vq. 

The function ip is called scaling function. Obviously, the functions Pj,k{x) 
:= 2^/‘^p{2^x - k) form an orthonormal basis of Vj (j £ Z) (see [11] for 
examples). Since p £ Vq C V\ there exists a sequence (which is 

called the mask or the filter of p) such that p satisfies the scaling equation 

p{x) = ^/2 ^ hkp{2x — k) , a; e E. (2.1) 

k&TZ 

Moreover, the mask {/i*,} is finite if and only if p is compactly supported. 
(2.1) is the key to the constructions of orthogonal wavelet bases and of fast 
wavelet algorithms. The wavelet space Wj (j e Z) is defined as the orthogonal 
complement of Vj with respect to Vj+i = V} 0 Wj. Thus 

L‘^{R) =®j^ 2 Z^ 3 —^m®®j>mWj (for each m G Z). (2.2) 

One of the main results of MSA reads as follows. Given p, then there 
exists xp £ L^(E) such that the functions V’j.jfe(a^) := 2^^‘^xjj{2^x — k), k £ Z, 
form an orthonormal basis of Wj (j £Z). The function tp is called the mother 
wavelet of MSA and is defined by 

ip{x) = ^ 3*<^(2x - k), gk = (-l)*/ii_/fe. (2.3) 

An important property of wavelets is that certain moments vanish, i.e., 

j x^ip{x)dx = 0, 0 < I <d*, (2.4) 

R 

where in the case of orthogonal wavelets d* is the order of polynomials which 
can be written as linear combinations of the translates p{- - k),k £ Z. 

More flexibility is offered by the concept of biorthogonal wavelets [11] 
which permits the employment of H-splines as scaling functions. In particu- 
lar, the possibility of raising the order of moment conditions turns out to be 
essential for balancing compression and convergence rates. 




192 W. Dahmen, B. Kleemann, S. Prossdorf, R. Schneider 



In the following we need a periodic version of MSA. To this end, let [•] 
denote the periodization operator defined by [f]{x) Z1a;62z/(^ 

any compactly supported function /. Given a compactly supported MSA 
(p € then the functions [(pm,k] 2 tnd [^pm,k] are 1-periodic. MSA of 

(E^ ) can be defined in a completely analogous manner. However, one needs 
a finite number of mother wavelets depending on the type of scaling. 

2.3 Generalized Petrov— Galerkin Schemes 

The spaces Vj will be used as trial spaces for the approximate solution of the 
equation 



Au = f, (2.5) 

where A £ and / € is given. In the following we will fix one such 
t and assume that r] G JT^“^(E^) is a fixed linear functional with support 
in some compact set F C W. Defining the functionals rjj^k by rjj^kif) 
2-nj/2^(y(2-i(. + k))), k e we seek for an element uj G Vj satisfying 

vjAM) = vjMf ) . e == (2.6) 

Clearly, rj = (f corresponds to a classical Galerkin scheme, while rj = 6{--xo) 
give rises to collocation at the points 2~^{k + xo), fc G j G Nq. 

Our first objective is to study the solvability of (2.6), and if this is the case, 
the convergence of the solution Uj as j oo. The key to this problem is a 
suitable stability concept. To this end, it is convenient to consider projectors 
of the form 

Qjf “ 'nj,k{f)Cj,kj 

where the are suitable basis functions satisfying rjj^kiCj.i) = ^i,k (possibly 
spanning spaces different ft*om Vj, see [6] Sect. 4). Then (2.6) can be rewritten 
as operator equation QjAuj = Qjf. The scheme (2.6) is called {t,r)-stable if 

\\QjAv\\t-r > cl|i;||t (2.7) 

for all V £ Vj uniformly in j G Nq. Clearly, (2.7) means that the finite 
dimensional operators Aj := QjAPj : H* have uniformly bounded 

inverses Aj^ : im Qj im Pj. 

It turns out that the stability of (2.6) is equivalent to the ellipticity of a 
certain function A which shares many features with the principal symbol (Jq 
of the operator A and which is called the numerical symbol (more precisely, 
the “symbol of the Petrov-Galerkin scheme (2.6)”). The numerical symbol is 
defined by 

\{x,y) := <To(a;,j/ + fc)^(j/ + fc)^(y + A:) 

keTZ" 



( 2 . 8 ) 
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for all x,y e (provided that the series on the right hand side of (2.8) 
converges absolutely for all x,y e T^), where " stands for the Fourier 
transform. 

Theorem 2.1 ([6]). The scheme (2.6) is {t,r)-stable if and only if the nu- 
merical symbol X is elliptic, i.e. |A(x,2/)| > c\6{y)Y holds uniformly inx £ T'^, 
where 9 = (9i, ...,9n)'^ , 9j{y) = - 1. 

Example 2.1. Consider the knot collocation (i.e. xo = 0) with tensor product 
splines of degree d. Then ^ = 1 and (p{^) = (^6)^*^^- 

Clearly, if d is odd, then A is elliptic if and only if A is strongly elliptic, i.e. 
Re ao > const > 0. Hence, in that case. Theorem 1 provides stability of the 
collocation with odd degree splines for strongly elliptic operators (cf. [1] for 
the case n = 1 and [6] for the multidimensional case). Notice that, in the 
case of the classical Galerkin scheme, the ellipticity of the numerical symbol 
for strongly elliptic operators is a consequence of the stability property of cp. 

Applying Theorem 1 in combination with well known Galerkin techniques 
and approximation properties of the functions [(fj,k\ one obtains optimal es- 
timates for the error ||u — for a certain range of Sobolev norms (see 
[ 6 ]). 

2.4 Matrix Compression and Fast Solution 

This technique is essentially based on estimates for the asymptotic behavior 
of the entries of Aj {j oo) in the wavelet representation. We will present 
such estimates only for the case of classical Petrov-Galerkin schemes, i.e., 
when T] is actually an L^-function. 

Theorem 2.2 ([7, 17]). Let 2d* -h n -h r > 0 where d* denotes the number 
of vanishing moments of the wavelets. Then for A £ the estimate 

I *],[#,*']) I < gn+r+2d-+2 (2.9) 

holds, where g = dist (supp supp [^i\k']) o.’^d the constant c depends 

only on r, n and d* . 

(2.9) are the crucial estimates which compression criteria rely on. Such 
criteria allow us to avoid the computation of the full stiffness matrix in the 
wavelet representation and tell us which entries must be computed in order 
to guarantee a required accuracy. Moreover, by realizing sufficiently high 
accuracy on lower scales, the asymptotical convergence rates of the solutions 
to the uncompressed systems can be preserved for those of the compressed 
system. Such a compression strategy reduces the computational work to the 
order 0{N{logNy), where N = 2-^^ and 6 is a positive number ([7]). 
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3 Exterior Dirichlet problem for the Helmholtz 
equation in 2D 

3.1 Statement of the problem 

The treatment of scattering of time-harmonic acoustic and electromagnetic 
waves on infinitely long cylindrical obstacles in with simply connected 
cross section D- C and smooth boundary F leads to an exterior boundary 
value problem for the Helmholtz equation in . 

D Given the boundary data g and 

r the wave number k > 0 in D, real. 



yjin 



We seek the solution w of the following problem 

Aw{x) k^w{x) = 0 in D = E^\D_, (3.1) 

w{x) — g{x) on T, (3.2) 

- - ikw{x) = o(r“^/^) , r = |x| -4 00 (3.3) 

or 

uniformly in all directions {SommerfelFs radiation condition). This problem 
is known to have a unique solution (cf. [5]). 

Using a single-layer approach (indirect method), one seeks w in the form 

w(x) - 4>{x, y)i{y)ds{y) , x & D, (3.4) 

where s is the arc length parametrization, and the fundamental solution cor- 
responding to the Helmholtz equation is given by 

<l>ix,y) ■= j^H^^\k\x-y\),x^y. 

Here means the Hankel function of order zero and of the first kind. 
Substituting (3.4) into (3.2) gives the boundary integral equation for the 
unknown density 

<A(a:, y)^(.y)ds(y) = g(x) , x € T . (3.5) 

Equation (3.5) is uniquely solvable in H^{F) provided that the homogeneous 
Dirichlet problem for the interior of F admits only the trivial solution. 
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Let x{t) = (xi {t),X 2 {t )) , 0 < t < 27 t , be a 27r-periodic regular parametri- 
zation of F satisfying [x[{t)f + [x 2 (t)]^ > 0, t £ F. Setting in (3.5) u{t) := 

{[x[{t)]^ + [x 2 {t)^^y^‘^ and f{t) := g{x{t)) , the exterior Dirichlet 
problem for the Helmholtz equation leads to the following logarithmic singular 
integral equation of the first kind: 

{Au){t) := f K{t,T)u{r)dr - f{t) , 0 < t < 27 t, (3.6) 

Jo 

K{t,T) := jH^^\kr{t,T)) 

where r(i,r) := {[xi(i) - a;i(r)]''^ + [x 2 {t) - X 2 {T)fY/'^. 

In principle it is possible to separate the logarithmic part of the Hankel 
function and treat it separately. There exist several fast methods for the 
solution of the logarithmic kernel equation (also called Symm’s equation). In 
particular, one can use the fast Fourier transform, apply the fast method [20] 
or the exponentially convergent method [14]. Here we do not separate the 
logarithmic part because we only consider it as a model problem for other 
cases where a separation is usually not applicable. A second related reason 
is to simplify the implementation by keeping it as independent as possible 
of further analytic investigations which have to be tuned to the particular 
application at hand. 

3.2 Wavelet Discretization Method 

For the numerical solution of (3.6) consider the knot collocation method on 
the following nested grids of [0, 27 t]: 

V' = := khi, k = 0,...,Ni -l,hi = 2-' ■ 27T, Ni = 2'}, I = 

The set of additional knots for passing from level 1-1 to level I is denoted by 
. For a corresponding Galerkin wavelet method the reader 
is referred to [17]. As trial functions we use here continuous piecewise linear 
functions spanned by the canonical periodized hat functions [(fj,k]: i-^-, 

{ 1 + X if — 1 < X < 0, 

1 — X if 0<a:<l, 

0 else. 

In this case the mask in the two-scale relation (2.1) is 

= (3.7) 

Since the translates (/?(• — k) are not pairwise orthogonal the simple recipe 
from (2.3) for forming orthogonal wavelets does no longer work. Nevertheless, 
one can resort to the literature for a long list of candidates 'ip{x) whose 
integer translates are stable and span the orthogonal or other complements 
of Vo in V\. In particular, for any desired order (F of vanishing moments 
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(f(x) = ^(p{2x H- 1) + (p{2x) H- ^(p{2x - 1) 





Fig. 3.1. Two-scale relations for our choice of generating function and mother 

wavelet. 



biorthogonal wavelets have been constructed in [4] which all give rise to Riesz 
bases. Specifically, the wavelet for d* =2 from this family hsis support [—1, 2] 
and the mask hats the form 

{9k}k=-i - ^8’ 4’ 8^* 

To keep the support of xjj as small as possible we will focus here though on 
the following choice with also vanishing moments of order d* —2\ 

= (3.9) 

which is depicted in Figure 3.1. These wavelets are likewise biorthogonal in 
the sense of [4] and therefore give rise to a Riesz basis ats is shown in [18]. 
Since we are employing a collocation method the Brandt/ Luhrecht functionals 
[2] are adequate “test wavelets” (because of pointwise evaluation) spanned 
by (^-distributions: 

= /(4t+i) - 5[M) + M+i)]- 

A: = 0, 1; %,*(/) = = 0, 1- 



(3.10) 
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One easily verifies that the rji^k have also vanishing moments of order d* =2. 
Figure 3.2 illustrates for three levels I how the point values are transformed 
into the data f]i^k{f)- Obviously, the complexity of this transformation is 
of the order 0{Nj) when j is as above the finest level. Given (3.10), the 

Mill), 

• : 4'-' 

/(4+i), V' 

I 

• : A‘-^ 

/(4;i). V'-' 

I 

m-'i.kif) 

Fig. 3.2. Generation of the functionals of Brandt/Lubrecht. Open circles indicate 
those grid points whose values are to be calculated in the next step. 






collocation stiflFness matrix Aj on level j relative to the wavelet basis has 
entries of the form 



■— Vl^k'i^bPhk]), ( 3 - 11 ) 

fc = 0,...,2'-i-l, fc' = - 1, 1,1' <j. 

It is easily seen that the integral operator A defined by (3.6) belongs to 
the class (see Sect. 2) with r = —1 and r\ = —3. Moreover, it is strongly 
elliptic and, hence, the considered collocation method is stable (see Theorem 
2). Furthermore, the error estimate 

\\u-Uj\\s < c2^^^~‘^^\\u\\2 (3.12) 

holds for all s, — 1 < s < 3/2, provided / G H^. 

Using an a-priori compression criterion established in [7] and improved 
in [21] we only calculate and store approximations to those matrix entries of 
(3.11) obtained through quadrature for which 

< max{a2-',a2-*',aj5/« • 2(V3W-(4/3)J-(2/3 )j' J (3 ^3) 

where Qi^k is the support of [^i,k], ^v,k' the support of r]i-^k’ , and a a constant 
that has to be chosen appropriately. Note that, according to the general 
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theory, to realize an (asymptotically) almost optimal compression rate for the 
choice of piecewise linear wavelets with two vanishing moments it would be 
necessary to use Brandt /Lubrecht functionals with three vanishing moments. 
In this case also criterion (3.13) changes somewhat. Nevertheless, the present 
method turns out to be more efficient for the present scope of experiments 
because of the smaller support of the test functionals. Also our convergence 
studies confirm a sufficiently good nearly optimal performance of the present 
version in all our test problems (see Figs. 3.3,3.4,3.5). 




JV, 



Fig. 3.3. Error Ej in logarithmic scale for various wave numbers k and constants 
a for the a-priori compression strategy. F = circle. 



In order to get a fully discrete method we have to employ a suitable 
quadrature method to approximate the integrals. Because of the logarithmic 
singularity of the kernel function it is necessary to use an adapted rule for 
the quadrature to guarantee that the order of convergence is better than one. 
Therefore we choose the following graded quadrature points Tq of the interval 
[0,27t] with grading parameter a > 1. More precisely, the points are graded 
near the singular point t E (0, 27 t) as follows: 



Tq=7T 



fh 



sign q-\-t, q= -rh, ...,m - 1. 
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Fig. 3.4. Error ej in logarithmic scale for various wave numbers k and constants 
a for the a-priori compression strategy. F = circle. 

The integer m has to be chosen proportional to Nj . To make the quadra- 
ture exact for the ansatz functions these graded points are united with 
equidistant ones: ^ = {t{}. Let Tq,q = 0, . . . ,m— 1, m — 2fh-\-Nj be 

the quadrature points obtained by periodizing {r^} U {r®}. In our numerical 
examples we choose a grading parameter a == 2.5 together with the trape- 
zoidal rule. The latter is exact for linear polynomials, and hence sufficient to 
obtain second order convergence in (cf. (3.12)). 

3.3 Assembling the compressed stiffness matrix 

Let 

m— 1 

^ K{ti,Tg)ujgi)i^kiTg), k = 0,...,2^ - I, I <j, 

q = 0 

Tq 

be the approximation of obtained by quadrature. Here Uq are 

quadrature weights. 
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10 

1 



QA 
0,01 
0.001 
0.0001 
ie - 05 



le - 06 



le - 07 

10 100 1000 10000 

Fig. 3.5. Errors Ej , ej in logarithmic scale for an elliptical cross section and fixed 
wave number = 10 for two different constants a for the a-priori compression 

strategy. 



Keeping in mind that 

and that the vanishing moments of the test wavelets r]i^k imply that these 
quantities decay, their approximations obtained through quadrature are ex- 
pected to exhibit the same behavior. Thus we have to compute the following 
expressions: 

m— 1 

q=0 

.. m— 1 

<j=0 




(3.14) 
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^ m— 1 
«=o 

i = 0,. . . ,Ni' - I, k = 0, . . . ,Ni - I, 1,1' = I,. . . ,j - I, 



m— 1 






g=0 

m— 1 



-j X 

9 Z) ^(*f.'T«K[¥’l,fc](T-,) - 



q=0 

m— 1 



1 Kit^i+l,Tg)Wg[tpi^k]{Tq), 



q = 0 



+1 



Z-0,...,7V^. -1, A: = 0,1, 



(3.15) 



m— 1 

mdM^i,k])= (3.16) 

q = 0 

T„qtt] 

i = 0, 1, k-0,...,Ni-l, l-l,...,j - I, 

m—1 

mdMfhk]) = X] (3-17) 

<j=0 

Tg^t] 

i = 0, 1, A: = 0, 1. 

Here it is important to note that, according to our above remarks, these 
calculations are only performed for those pairs (/', A:'), (/, A:) which satisfy 
condition (3.13). Thus storage is only required, and hence is directly reduced, 
for the significant entries of the matrix Aj defined by 






( 0 if (3.13) does not hold, 

\ m',k'{Aj[xlJi,k]) if (3.13) holds, 



where A:' = 0, . . . — 1, A: = 0, . . . , A^/ — 1, i, T = 0, . . . , j — 1 and where we 

have set i/jo.k '= 

Next let us denote by Bj the transformation (3.10) which takes the array 
of data k = 0, . . . ,Nj — I, into the array rji^kif)- One can check that 
Aj is the compressed version of the matrix BjAj. Hence we end up with the 
following sparse matrix equation 
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= Bjfj, (3.18) 

where fj := (/(<o), /W). • • • 

Note that it is possible to use the same quadrature formula as for a stan- 
dard collocation method not based on wavelets for all expressions in (3.14)- 
(3.17). It is only necessary to shift the grading for the singularity to the actual 
singular point. 

Clearly, the solution uj of (3.18) consists of the coefficients of the approx- 
imate solution relative to the wavelet basis. If one wishes to represent the 
solution in terms of the nodal basis, a further transformation 






has to be performed. This transformation also has the following familiar 



pyramid structure 








wi,. 


W2,. 




W3,- 


Wj-1, 




\ Cl 


\ 


C2 


\ 


Wi,. 


W2,. 


-> 


U3,. ■ 


■■ -> 




Ml 


M2 




Mj-1 



C'.-i 



(3.19) 



where the matrices Mj, Cj, i = 1, . . . , j — 1 are sparse and contain the 
masks from (3.7) and (3.9). For a description of the algorithm in pseudocode 
see [10]. Thus Tj takes coefficients uj := wi^k} of the wavelet representa- 

tion Efc=o,i + Efce/i' into the coefficients uJ := {uj,*} 

of the corresponding nodal representation Also the applica- 

tion of the transformation Tj is easily seen to require only 0{Nj) operations. 



3.4 Preconditioning and solution 

The linear system (3.18) is solved with the aid of the iterative Krylov sub- 
space method GMRES (cf. [19], [24]). This method has been proved to be 
efficient for this kind of systems (cf. e.g. [23], [13], [10]). Since r = —1, the con- 
dition numbers /c(Ap grow like Nj so that the number of iterations which is 
necessary to accomplish a desired accuracy increases when Nj becomes large. 
The theory in [21] suggests multiplying A^j with the Nj x Nj diagonal matrix 

D from the right where = (k‘^ + 2^^) = • <5(1, fc), After solving 

then the system A^jDvj = Bjfj one has to transform the resulting vector 
according to uj = Dvj to obtain the wavelet coefficients of the solution. 

We have also tested the following alternative preconditioning strategies 
based on diagonal scaling. Let D^i^k),{V ,k') = 2^ * ^{i,k),(i',k')- 

1. Solve DA^jUj = DBjfj or 

2. solve A^D^Vj = D^Bjfj followed by corresponding diagonal back 

transformations . 
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The results of the tests for a fixed example are given in Table 3.1. The best 
preconditioner is the one suggested by the theory which gives a constant iter- 
ation count. May be that with higher order Brandt /Lubrecht functionals the 
iteration count can further be reduced. The corresponding tools are available 
and, in principle, easily implemented. 



Table 3.1. Number of GMRBS iterations without preconditioning and with 
different preconditioners for the following example: F= ellipse, fc = 10, a = 1.0. 



Preconditioning | 


3 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


Nj 


4 


8 


16 


32 


64 


128 


256 


512 


1024 


2048 


4096 


without 


3 


5 


9 


14 


18 


21 


29 


33 


37 


40 


44 




3 


5 


9 


15 


18 


20 


25 


27 


28 


28 


29 




3 


5 


9 


15 


17 


19 


24 


27 


26 


26 


27 




3 


5 


9 


14 


15 


16 


21 


23 


23 


22 


22 



The main steps of the solution procedure may be summarized now as follows: 

1. Compute the matrix Aj directly in the wavelet representation as described 

by (3.14) - (3.17) and apply the a-priori compression criterion (3.13) 
yielding a sparse matrix; 

2. apply the transformation Bj (3.10) to the right-hand side: 

^ {%A;(/)}, k = Bjfj, which requires only 

0{Nj) operations; 

3. solve the sparse matrix equation A^jDvj = Bjfj by an iterative solver (e.g. 

GMRES). The complexity of this part is nearly 0{Nj {log Nj)^)] 

4. back transform the solution of this linear system to uj = Dvj {= Nj op- 

erations), which provides the coefficient vector uj relative to the wavelet 
basis and 

5. (optionally) apply the transformation Tj which yields the coefficients u'j = 

TjUj for the nodal basis representation. Again this only requires 0{Nj) 
operations. 

Clearly, the multiplications with the diagonal matrices are applied to the 
iteration vector rather than to the matrix A^ Aj{Du^^^), with the actual 
iteration vector of uj during the GMRES-iteration. This only requires 2Nj 
additional operations for the diagonal matrix times vector products in each 
iteration step. 

Summing up the operation count in steps 2.-5. the overall complexity is of 
order 0{Nj{logNj)^) which gives rise to a fast solution of the linear system. 
Unfortunately, step 1., the matrix assemblation, is still of the order 0{Nj) 
although only (nearly) 0{Nj{logNjY) matrix entries are calculated. This is 
due to the poor quadrature scheme which does not take the distance between 
the supports into account. Meanwhile it has been shown in 

[21] that such a refined quadrature strategy reduces the computational work 
for assembling the matrix to the order of nonvanishing matrix entries. 





204 W. Dahmen, B. Kleemann, S. Prossdorf, R. Schneider 



After the evaluation of the solution, the relative error Ej of the solution 
and the convergence rate Pi are calculated: 

p _ _ \og{Ej/Ej-i) 

3.5 Numerical examples for scattering 

We consider the scattering of an E-polarized electromagnetic plane wave 
by a perfectly conducting cylinder. In acoustics this problem is equivalent to 
the scattering of a plane wave by an impenetrable, sound-soft cylinder with 
smooth cross section. Let the incident wave be given by v}^{x) = exp(ifceinx), 
with 6in a unit vector describing the direction of propagation of the incident 
wave, and let u® be the scattered wave. Then the exterior wave (total field) 
-f n® is also a solution of the Helmholtz equation Au^ -f = 
0 in D and satisfies the boundary condition u® = 0 on F. So we have u® = 
on r and t/® also satisfies the conditions (3.1) and (3.3). Thus in 
terms of our initial unknowns we have: 

• w := n®, 

• g iz®|r = — exp(ifc^n^*^)|r and if we set for the incident direction 

Cin = (1, 0), we obtain 

f{x{t)) := -{cos(kxi{t)) +isin(kxi{t))). 

With these data one can then solve the integral equation (3.6). 

The solution it®, the backscattered electric field, exhibits the following 
asymptotic behavior which follows from the asymptotics for the Hankel func- 
tion for large argument: 

U°°ix) = ^^(y)exp(-ifcxy)ds(j/). 

uniformly for all directions x := x/\x\. The function is known as far field 
pattern or scattering amplitude of u®. Substituting for ^(y) the solution u{t) 
of our integral equation, we obtain 

u^{x) = ^ [ u{t)exp{-ikeohx{t))dt. (3.20) 

vSttA: Jr 

A more interesting quantity is the radar cross section (RCS). It is defined 
by (7^(6 ) — 27rlim|a.|_4ooX-jyp, with 0° < < 360°, the backscatter angle. 

With the aid of the asymptotics of u® we obtain the following approximation 
— 27t\u^{x)\^ for the RCS where x is the direction given by 6 . When 
plotting the RCS as a function of the observation angle, the quantity 
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(7 = 101ogio(tr^/A), A= wavelength (3.21) 



are shown in the figures rather than cr^ which is common in the engineering 
literature. 

After the solution of the integral equation by the wavelet method, the 
integral (3.20) is calculated which is a functional of the solution. This is re- 
alized by the rectangular rule which in the periodic case is equivalent to the 
trapezoidal rule. Let the approximation of (3.20) be u’^. For the determi- 
nation of and we fix the direction of observation. For a sequence of 
discretizations we determine the error ej and the convergence rate P 2 for the 
functionals 






02 = - 



log(ej/ej-i) 

log 2 



For all numerical examples we have selected a = 2.5 as an optimal grading 
parameter for the quadrature points and employing the a-priori compres- 
sion criterion, we have examined the convergence and compression behavior 
for various wave numbers k and constants a. In the first example we consider 
a disc of radius r = 0.6 as cross section of a scattering obstacle (cf. e.g. [3]) 
and we set Cob (1,0) while the second considered obstacle is assumed to 
have an elliptical cross section with = 2by = 0.6. 

Figures 3.3 and 3.4 present the behavior of the errors Ej^Cj for the disc. 
Figure 3.5 shows results for the case of an ellipse. Figures 3. 3,3.4, 3.5 clearly 
show the predicted behavior for the convergence rate of Ej , tj for the inves- 
tigated boundaries if the constant a for the a-priori compression criterion 
is appropriately chosen. The convergence rates of « 2 coincide with the ex- 
pected convergence rate of the solutions to the full systems when using the 
same discretization method. Because of (3.12) for ej a convergence rate of 3 
is possible when using a quadrature of higher exactness. 

The constant a from (3.13) in practical applications should be chosen so 
that the supports of '^j^k^'Hj.k' do not overlap on the finest level j. This leads 
to a fixed bandwidth for the diagonal bandmatrix on the finest levels j and 
determines all other bandwidths of the appearing block band matrices. For 
our special choice of 'ipi^k:Vi,k we recommend a bandwidth 6 of 5 > 4. For 
greater values of k or more complex domains D_ it may be necessary to 
increase this value by small amounts. 

Obviously, the error ej for the functional of the solution is not as sensitive 
to different compression constants a as the error Ej. This is due to the fact 
that a quadrature rule of higher exactness would yield a better compression 
rate for this error. 

Figures 3.6 and 3.7 show the number nze of nonzero elements of the com- 
pressed matrix Aj for some constants a and the time tcu for the iterative 
solution of the linear system by a preconditioned GMRES. One clearly ob- 
serves that the time depends on the wave number k as well as on the constant 
a for the compression criterion. 
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Fig. 3.6. Number nze of nonzero elements of the compressed matrix for two 
constants a for different wave numbers k. 




0 1000 2000 3000 4000 



Fig. 3.7. CPU- time tcM in seconds on a DEC 3000 AXP 500 of-processor 
workstation for the iterative solution of the arising linear system by GMRES for 
different wave numbers k and constants a. 
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Finally, Figs. 3.8,3.9,3.10 present some results for the RCS for two do- 
mains and two different wave numbers k. The results agree with those ob- 
tained with a conventional collocation method. 




Fig. 3.8. Radar cross section (RCS) a versus viewing angle for the scattering of a 
frequency of 2.4 GHz implying a wavelength of A = 0.125 m at a disc with radius 
r = 0.25 m. The quantity k • 2r determining the Hankel function values is equal to 

25.13. 




Fig. 3.9. Radar cross section (RCS) cr versus viewing angle for the scattering of a 
frequency of 10 GHz implying a wavelength of A = 30 mm at an ellipse with ax = 
60 mm, by = 30 mm. The quantity k • 2a determining the Hankel function values 

is equal to 25.13. 
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Fig. 3.10. Radax cross section (RCS) a versus viewing angle for the scattering of 
a frequency of 10 GHz implying a wavelei^th of A = 30 mm at an ellipse with a® 
= 300 mm, hy = 150 mm. The quantity k • 2a determining the Hankel function 

values is equal to 125.66. 



4 Dirichlet problem for the Laplace equation in 3D 

In this section new results for the multiscale method proposed in [10] are pre- 
sented for a large number of degrees of freedom Nj. Note that linear systems 
with nearly 100,000 unknowns are solved. Since the thresholding procedure 
is chosen so that the optimal asymptotic accuracy of the discretization error 
is preserved this corresponds to solving fully populated linear systems of the 
same size arising from non- wavelet approaches. 

We start briefly recalling the main ingredients of the method and refer 
to [10] for further details. Note that the theory outlined in Section 2 does 
not apply directly because we are dealing now with a non-periodic problem 
and the boundary of the considered polyhedral domain is not smooth. Nev- 
ertheless, the decay of the matrix entries in the wavelet representation on 
a face of the polyhedron [10] is of the same form as for the periodic case 
[7]. Accordingly, the results are also similar to those predicted in the model 
problem. 

It is a well known fact that the Dirichlet problem for the Laplace equation 
in some domain 'P C can be transformed into a second kind integral 
equation over its boundary [15]. For a polyhedral domain V this equation 
over the boundary fi = dV reads 
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Au:={I + 2W)u = f, (4.1) 

Wu{x) ■.= [1/2 - en{x)]u{x) + ^ £ !^y-k^u{y)dyf2, 

where 0a(x) is the inner solid angle of i? at x € i7 and Uy the unit vector 
of the interior normal to P at 2/- VF is the double layer potential operator. 
The kernel function k{x, y) := ^riy '{x-y)\y - x\~^ vanishes if x and y are 
located on the same face of O. 




Fig. 4.1. Triangle [xki^xk 2 ^xk 3 ] of a faee of a polyhedral boundary and its 
refinement. {Ki,K 2 ,Ks} C {Ki, K 2 , K{, K! 2 , K'^} C 

{K[,K!2,K's}cA^. 



4.1 Triangulation and discretization 

As mesh for the boundary i? a regular triangulation is used. Given some 
initial partition (mesh 17® ) of 17 into triangles, subsequent subdivision of 
each triangle of 17® into four congruent subtriangles leads to 17^. This process 
generates a sequence of nested meshes 17^ of depth or level j G No . Clearly, 
the meshsize h on the level j is proportional to 2“^ . The vertices xk of the 
triangles are the knot points of the mesh. The indices A" on a given level 
define a grid = {K : xk is a knot in 17-^}. Again, the set of additional 
knots added when passing from level j — 1 to level j is 

As trial functions, we employ properly normalized Courant hat func- 
tions (pj^K which are frequently used in finite element or boundary element 
methods. More precisely, each basis function is a continuous function 
on 17 whose restriction to any triangle r in 17^ is affine and satisfies the 
nodal conditions p>j^K{xK>) = for all K^K' G The spaces 

Vj span {^j^K K £ of piecewise linear continuous functions on 

17 are by construction nested ko C Vi C • • • C Vj-i C Vj. By considering 
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the restriction to the planar faces of V one readily confirms second order 
accuracy of the trial spaces. 

Consider knot collocation on the finest grid , i.e., we wish to determine 
a piecewise linear and continuous function E Vj such that 

Au'^^{xk) = /(xk), if e (4.2) 

Thus the entries of the collocation stiffness matrix are given by 

iA(pj,K'){xK) = f Ti^-¥>j,K'iy)dyf2, (4.3) 

2^ JsuppiPj K, \y ~ ^k\ 

K,K' eV^,K ^ K'. 

suppipj^K' consists of the six triangles r, for which knot K' is a common 
vertex. In order to get a fully discrete method quadrature which is known to 
be exact for polynomials p of degree at most two is used for approximating 
the integrals (e.g. [22]). To describe this, let r = [xKiy^K 2 ^^Ks] ^ and 
let xk[^xk’^,xk'^ denote the midpoints of its edges [xki,xk 2 ]j [xk 2 ^^K 3 ]i 
[xk 3 ,xki]^ respectively (Fig. 4.1). The quadrature formula for a function 
v{x) then reads 

/t;(x)dx ~ l^t^(xA:')|r|. (4.4) 

r *=i 

Moreover, to treat the singularities in a proper way, usually a regularization 
technique is employed (cf. [13]) based on rewriting Au = f as 

2u{x) + -^ -u(x)]dy = f{x), xef2. (4.5) 

Note that some calculations presented in Tables 4.1 and 4.2 are performed 
without singularity subtraction. 

In summary one obtains the following linear system with the discrete 
collocation matrix 



AM=f: 



3 ^ 






(4.6) 



4.2 Decomposition of Function Spaces 

Our approach is based on a multiscale decomposition of the given trial spaces 
induced by the following two scale relations. The refinement equation (2.1) 
now takes the form 

K € V-^. (4.7) 

The coefficients rrij^K'.K are called mask or filter coefficients. On those grid 
points located in the interior of the triangles r e the entries of the sparse 
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Nj^i xNj matrix can be represented by the following 

7 point stencil 

0 1 1 \ 

12 1. (4.8) 

110/ 

Here the bold value indicates a position K in the coarse grid V^ . Relation 
(2.3) which complements the basis in Vj to one for Vj+i, now reads 

V-i.K- = (4.9) 

if'evj+i 



and in the interior of each r G cj^k',k corresponds to one of the following 
3 point stencils 



/ 0 0 0 

- 12-1 
\ 0 0 0 



1 

2 ’ 



0-10 
0 2 0 
0-10 



1 

2 ’ 



0 0 -l\j 

0 2 0 -, (4.10) 

-10 0 / ^ 



depending on the direction of the edge containing the knot xk^ K G A^. Here 
the boldface values correspond to indices K £ AK 

Now let as before Tj denote the transformation that takes the coefficient 
vector Uj of some element of Vj relative to the multiscale basis {(fo,K • ^ ^ 
V^} U K £ A\ I = 0, . . . ,j — 1} into the coefficient vector relative 

to the nodal basis {^j,K : K £V^}. The linear system (4.6) is equivalent to 

Ajuj^T;fj, fj = f{xK),K£V^, 

where Aj := TjA^jTj is the collocation stiffness matrix relative to the multi- 
scale basis. 

The transformation Tj has the same pyramid structure as described in 
Section 3. Due to the finite supports of the masks its application requires the 
order of 0{Nj) operations. 

The essential steps of the present preliminary version of the algorithm 
may now be described as follows: 

1. The discretized collocation method in the nodal basis gives rise to a linear 

system (4.6). The complexity of assembling the matrix is 0{Nj). 

2. The transformations Tj^Tj lead to the matrix Aj := TjA^Tj, so that 

the system is equivalent to Ajuj = Tjfj, and uJ = Tjuj. Since AJ is 
fully populated the complexity of this part is still 0{Nj). In view of the 
order of our quadrature rule, the estimates in [10] predict a significantly 
faster decay of the entries in Aj than of those in AJ. Replacing all those 
entries in Aj by zero whose modulus stays below a given threshold th 
yields the compressed system AjUj = Tjfj. To improve the scheme one 
has to compute Aj directly without going through AJ. The theoretical 
concepts needed to perform this step in an efficient way have now become 
available [21]. 
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3. This latter system is solved with the aid of a sparse iterative solver (e.g. 
GMRES) without preconditioning since the order of the operator is zero 
in this case. 

The decay estimates (see [10]) allow us to estimate the deviation of the 
solution of the compressed system from the solution to the full system and, 
in some cases, also from the exact solution of Au = /. As pointed out above 
at this stage this method has still an overall complexity of 0{Nj). Only the 
last step, after the thresholding procedure, has a complexity which is nearly 
linear and therefore leads to a fast solution of the linear system. As one can 
see in Tables 4.1 and 4.2, quadrupling the number Nj of unknowns leads to 
quadrupled nze only. In contrast to the method of Section 3 it does not save 
yet memory because in the first two steps, AT? matrix entries are handled. 
Nevertheless, so far the primary objective of these test has been to confirm 
the performance of such techniques with regard to compression and conver- 
gence behavior in situations where not all the assumptions of the model case 
are fulfilled. The results show that efficient compression is possible without 
deteriorating the resulting accuracy of the discrete solutions significantly. 

4.3 Numerical examples 

For our numerical experiments we consider a Dirichlet problem for Laplace’s 
equation AU{x) = 0,x eV, and smooth Dirichlet data, U{x)\n = f{x), x — 
(i^j 3 ^) ^ In spite of the smoothness of the Dirichlet data the solution 
will generally not be smooth because the boundary is not smooth. 

Here we mainly present numerical experiments for two different polyhedra, 
a cube, and a pyramid. For a nonconvex domain (bench) and further more 
detailed results, see [10]. 

Recall that the solution U of the above Dirichlet problem with boundary 
data f{x), X e f2 has the representation 

where u is the solution of the double layer potential equation Au = /. Specif- 
ically, for our tests we choose f{x) := U{x)\f} with the harmonic function 

~ y^((ia: + l)^ + 2a;^ +3x‘^)~i,x eV. 

In order to control our compression error, we have to monitor the error 
between the exact solution u, and the approximate solution uj obtained by 
the given (uncompressed) discretized collocation method on a grid with 
N = Nj knots. Since we are often interested in the potential [/, or in some 
functionals of the boundary data u, rather than in the values of u, we directly 
go ahead and determine first an approximation of U as follows. Inserting uj 
into the representation formula above and computing the integral via the 
same quadrature rule used for the computation of the entries of the stiffness 
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matrix, we derive the following formula for the approximate solution of the 
boundary value problem (cf. Fig.4.1): 



Uf{x) := 






(a; - y'li 



4w ^ 3^ 

T^OJ t=l 






(4.12) 



where for r = [yKt,yK 2 ^yKz] the points are defined by y^, = + 

yKk) ^ = 1j 2,3 with A: = i + 1, for i = 1, 2 and with fc = 1 if 

i = 3, are the midpoints of the edges of the triangle r and uj{y^,) := 

As a further control we replace the exact solution u of (4.5) by the numer- 
ical solution uj on a very fine grid and compute ERRj = nln where 

here ||ti||o := i'^KeOj This error is equivalent to the relative 

-error (see [10]). 

In the multiscale representation Aj of the operator we discard those ele- 
ments which are below a given threshold th and end up with a compressed 
or sparsified matrix Aj. The ratio between the total number N‘^ of matrix 
elements of the full matrix and the number nze of nonzero elements after 
thresholding defines the compression rate= cpr By compression we 

obtain a perturbed system. The solution of this perturbed system yields the 
approximate solution Uj (which at this stage is given in its multiscale rep- 
resentation). Evaluating uj at the points and substituting these values 
into the discrete representation formula (4.12) for provides Uj{x). The 
evaluation is most efficiently performed by transforming the solution of the 
compressed system into nodal basis coefficients with the aid of Tj and ex- 
ploiting then the localness of the basis functions 

Of course, the compression causes an additional error. An acceptable com- 
pression should have only a negligable influence on the precision of the final 
approximate solution. To monitor uj we again replace the exact solution 
u of (4.5) by the numerical solution uj on a very fine level J and com- 
pute ERRj = . In order to determine which threshold results in 

a suitable compression, we compare this error with the error ERRj of the 
discretization scheme. Similarly we compare ERR^.* with the correspond- 
ing error \U{x'^) — Uj{x*')\. To estimate this error we compute the maxi- 
mum MERRa; maxj=i^ 2 , 3 |t^(^*) — Uj{x'^)\. This is the second quantity 
that should help determining which compression rate still has a negligable 
effect on the precision of the approximate solution. Finally, we compute the 

additional error arising from the compression ERR := ? which is also 

displayed in most of the tables. 

The main results for the geometries we have tested are presented in Tables 
4.1, and 4.2. The bold quantities refer to the largest threshold th and the 
corresponding solution uj for which ERRj « ERR^ on the same level j. In 
this case, thresholding apparently has a negligable influence on the solution 
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Uj. Furthermore for the bold quantities one has MERR* « ERR^.* , z = 1, 2, 3 
and for j > 4 even MERRa. < ERRa.i,i = 1,2,3. We observe that we can 
choose a larger threshold, and this results in a better compression for larger 
AT, satisfying MERR^; « infiERRa,* , i = 1,2,3. 



Table 4.1. Number nze of nonzero elements, compression rate cpr, errors ERR 
and MERRa; for the multiscale algorithm for the Laplace equation on the cube for 
several thresholds th and levels 



Cube 1 


j 




th 


nze 


cpr 


1 ERR 


MERRx 1 


1 


26 


3- 


QH 
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1.2 
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1 • 
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3.0 • 10"^ 




98 


1- 
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3 
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3- 


OBI 


36,484 


IHBII 
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mu 
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B 
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1 

o 


BS 






3- 


H 


12,646 




1 2.3 10-2 


BlB 


■1 
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401,272 
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3- 


HiBl 
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10.3 


IBS 


BS 


■ 




1- 


EES 


125,524 


18.8 


KIlB 


BEm bi 


5 


6,146 


3- 


inBi 


2,098,108 


18.0 


mu 


9.2.10"'' 1 






1 • 


l^D 


1,118,296 


33.8 


BS 


EBiBiiBfl 






3- 


BB 


647,032 


58.4 


bbB 


2.4 • 10"® 






3- 


QBI 


6,755,512 


89.4 


- 


■HtiBlihM 






1 . 


BB 


3,953,114 


152.8 


- 


IfllTB 






3- 


BB 


2,478,484 


243.7 


- 


BbbI 


s 


98,306 


3- 


inn 


17,857,936 


541.0 


- 


renmBi 


■■ 




3- 


BB 


7,097,044 


1361.0 


- 


Bmmi 



^ Calculations without singularity subtraction. Coarsest level is set to I — 2, 
^ Calculations without singularity subtraction. Coarsest level is set to / = 3. 



In Figs. 4.2, and 4.3 the number nze of nonzero elements of the compressed 
matrices Ap for which we observed acceptable precision (bold quantities in 
Tables 4.1, and 4.2) versus the number of knot points N = are plotted 
and compared with the elements of the dense matrix. Figure 4.3 exhibits 
a nearly linear increase of nze for greater N. The compression reduces stor- 
age significantly to 1/cpr • iV^. Likewise the CPU time for the matrix- vector 
multiplications during the iterative solution of the linear system decreases 
substantially (see Fig.4.4). 

For the solution of the discrete and compressed scheme we again use the 
iterative method GMRES. For the estimation of the condition numbers K(>lp 
and we apply a direct solver. This method is an expert driver of the 

well known LA-package [12]. The number of iteration steps, the estimated 
condition numbers and the CPU time tcM for the GMRES solution on a 
DEC 3000 AXP 500 workstation are presented in Table 4.3. The termination 
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bound for the iteration process is chosen to be about ERRj/100. Note that 
since the operator has order zero in this case and since we have not employed 
an orthogonal basis the condition of the original matrix should be better 
than that of the transformed matrix. Furthermore, compression should have 
a minor influence on the condition numbers which is also confirmed by our 
experiments. 



Table 4.2. Number nze of nonzero elements, compression rate cpr, errors ERR 
and MBRRa; for the multiscale algorithm for the Laplace equation on the pyramid 
for several thresholds th and levels j. 
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^ Calculations without singularity subtraction. Coarsest level is set to I = 2. 
^ Calculations without singularity subtraction. Coarsest level is set to / = 3. 



Our aim was to show the nearly linear asymptotic behavior of the nec- 
essary number of nonzero elements nze also in the case of a large number 
of degrees of freedom. So quadrupling the number Nj of unknowns leads to 
quadrupled nze only. In spite of a relatively strong compression one observes 
acceptable accuracy. For N « 100, 000 such a compression rate is about 500 
and we observed this rate to be more or less independent of the underlying 
geometry. Therefore in CPU-time solving the linear system by an iterative 
scheme the speed up factor was dramatic - solving a system corresponding 
to 100, 000 degrees of freedom, only takes 3 minutes on a sequential work- 
station and the whole matrix goes into a main memory of « 200 MB. Our 
method damps the coefficients away from the singularity. Thus the double 
layer potential operator for the Laplacian can be well compressed. We expect 
the same to persist for the double layer potential for the Stokes system. 
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Table 4.3. Laplace equation: Number of iterations it for the solution of the linear 
systems = fj and AjUj = respectively, with GMRES; CPU- times t 

in seconds on a DEC 3000 AXP 500 a-processor workstation. Condition numbers 
K of the corresponding matrices. 
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^ Calculations without singularity subtraction. Coarsest level is set to / = 2. 
^ Calculations without singularity subtraction. Coarsest level is set to / = 3. 



One advantage of the present multiscale technique applied to boundary 
integral equations is that we have additionally a simple a posteriori crite- 
rion to decide which coefficients are essentially required. Applying the present 
thresholding leads often to much better compression than the a priori 
choice of coefficients. This seems to apply to our double layer potential op- 
erator. 
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Fig. 4.2. Number nze of nonzero elements of the transformed matrix Aj for the 
Laplace equation after thresholding in dependence of Nj . Dotted line for 
comparison: quadratic behavior of the number of elements of the original stiffness 

matrix. 
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Fig. 4.3. Number nze of nonzero elements of the transformed matrix Aj for the 
Laplace equation after thresholding in dependence of Nj in logarithmic scale. 
Dotted lines for comparison: quadratic behavior and linear behavior. 
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Fig. 4.4. CPU- time in seconds on a DEC 3000 AXP 500 a-processor workstation 
for the iterative solution of the linear systems for the Laplace equation with 
uncompressed (dotted line) and compressed matrix, respectively, with GMRES. 
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5 Concluding Remarks 



We have outlined some theoretical foundations of multiscale methods and 
have carried out corresponding numerical experiments for two types of bound- 
ary integral equations. Galerkin methods being better understood we have 
concentrated here on collocation. The first case study concerning the Helm- 
holtz equation is covered by the existing theory and shows that aside from 
the natural effects of large wave numbers also operators of order different 
from zero can be handled efficiently. The experiments are of significant help 
in clarifying the quantitative effects of the various ingredients of the scheme. 
The second example is a more complex three dimensional problem which is 
not fully covered by the analysis of the model problems. It could be shown 
though that nevertheless even for large problem sizes the basic compression 
and convergence properties predicted by the theory for the periodic case still 
persist to be valid. 
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Galerkin-Type Boundary Element Analysis for 
3D Elasticity Problems 

H. Andra and E. Schnack 



1 Introduction 

In this paper, we introduce a formulation that is based on the Galerkin- 
type approach, leading to double integrations. In this formulation, the use of 
both singular and hypersingular boundary integral equation leads to a fully 
symmetric-definite equation system, a property of considerable advantage. 
The GALERKiN-type approximation of the hypersingular boundary integral 
equation does not lead to complicated point functionals like in the colloca- 
tion method and is well suitable for non-overlapping domain-decomposition 
algorithms [4]. 

The requirement of performing double integrations of (hyper-) singular kernel 
functions has limited the implementations of symmetric Galerkin boundary 
element method (GBEM) to those involving straight elements [11, 17, 18], to 
two-dimensional problems [11, 10], or to only potential and similar problems 
in three dimensions [21, 6]. 

Since analytical integration is not possible for general curved elements, one 
has to rely on numerical integration; but due to the singularities in the kernels 
ordinary numerical quadrature schemes fail to produce an accurate solution or 
need thousands of integration points. Many proposed methods approximate 
the curved elements by fiat elements of approximately the same proportions 
tangential to the curved element at a projection point (see e.g. [8, 21, 6, 5]). 
Then integration is performed with respect to this special flat element or 
further mappings of the flat elements are used. Whereas, in this work the 
numerical quadrature is referred to a standard reference element like in the 
isoparametric finite element method. 

In Section 2 we formulate the boundary integral equations for the mixed 
boundary value problem in linear elasticity and introduce the Galerkin 
scheme. Section 3 is devoted to the integration method for the singular and 
regular integrals. By using special variable substitutions we obtain integrands, 
which have no singularities and can be treated numerically with standard 
Gaussian quadrature. We give estimates of the quadrature errors by ex- 
tending the estimation method of Schwab/ Wendland [23] from the two- 
dimensional case occuring in the collocation BEA to the four-dimensional 
case (see Theorem 3.2). In Section 4 we present some numerical tests. 
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2 Boundary integral equations in elasticity and the 
Galerkin boundary element analysis 

2.1 Boundary value problem 

We consider a solid body or structure occupying the (finite, simply connected, 
open) domain fi C , with a Lipschitz boundary surface T = Ti U T 2 , 
A n T 2 = 0, mes Pi > 0, with given Dirichlet data on Ti, and Neumann 
data on P 2 . The body is referred to a Cartesian orthogonal coordinate system 
X = (xi,X 2 ,X 3 ). We present a solution procedure via a system of boundary 
integral equations for the mixed boundary value problem of linear elasticity: 

Problem 1. (Boundary Value Problem) 

For given displacements u* on Pi and given tractions t* on P 2 find the dis- 
placements u — (ui,U2,U3) in satisfying the Navier differential equations 

A*u = fxAu 4- (A -b /i) grad div u = 0, Vx G i?, 

with mixed boundary conditions 

Ui=ul, VarGA, 

3 

ti = ^ ^ = tj ^ , Vx G i~2 2, 3), 

where aij is the Cauchy stress tensor, ti are the boundary tractions, uj is 
the unit outward normal, and p> 0,X> -|// are harness elastic constants. 

2.2 Boundary integral equations 

For the sake of simplicity we have restricted ourselves to the case of non body 
forces. 

The operator A* has the fundamental solution U (see [16, 3]) with 
Uij{x,y -x)= [(3 - 4v)6ij+r,irj] {i,j = 1,2,3), (2.3) 

where n = j/i - Xi,r = ^(eLi ^- ) ^ = ^- 

We obtain fundamental solutions with stronger singularities from (2.3) by 
taking certain derivatives. We apply the stress operator [3] on U{y,x) with 
respect to y to obtain the strongly singular fundamental solutions 

3 

~ x) = ^fej(Vy)([^i/]f — 1 {y i ^))nk{y) {i-tj = 1,2,3), (2.4) 

k=\ 



( 2 . 1 ) 

( 2 . 2 ) 
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where n{y) = {ni{y))i^i^s is an arbitrary unit vector. If we apply the stress 
operator once more with respect to x, we obtain the hypersingular funda- 
mental solution 

3 

^ki{^ x){[Tij]i— I {y^ x))rik{x) {i^j = 1,2,3). (2.5) 

k=l 

Furthermore, we introduce the trace mappings 

i>i = fo'P ■= V'Iri , <Pi = 7iV := [<^iji^)j'j\n]Ui 

Now the boundary integral operators, which are associated with the funda- 
mental solutions (2.3), (2.4), and (2.5), are defined by 

■= 27o,* lo,yU(x,y - x)ipi(y)ds{y), 

KijVi 2 v.p. 7^ / j^o^yT{x,y- x)(pi{y)ds{y), 

JTi 

Klj'ipi := 2 v.p. 7^_j, jl yT^{y,x- y)^i{y)ds{y), 

Dij(pi ~ -2 p.f. 7^ / 7^ 5(x,y - x)(pi{y)ds{y), 

where Be{x) = {y\ |y - x| < 6:(e > 0)} is the e-ball about x G . Kij, K[j 
are defined by Cauchy singular integrals, and Dij is defined by a hypersin- 
gular finite part integral in the sense of Hadamard. The weak solution of 
the boundary value problem (Problem 1) with u E (i?) := is 

equivalent to finding a solution of the following problem: 



Problem 2 (Boundary Integral Equation). 

Let he given {^l) E H2 (A) x IT2 (/^^). 

Find a boundary function (^^^) E Vr '= Hl2(r2) x H 2(Ti) satisfying the 
system of boundary integral equation 



D 22 V 2 + K[^ti = -Dv 2 U* + (/ - on T2, (2.6) 

- ^21^2 + Vnh = (/ + Ku)u^ - Vvit^ on A (2.7) 



in Vf (in the weak sense), i. e. 
K D 22 K[,\ (V2\ f^2\\ 

\[-Kn ViJUJ’UJ/ 



-Dn (^A\ 

I + Ku -V12 ) \t*)'\xpi)/ 



V(£) e Vr. 



Under these assumptions, there exists a unique solution of Problem 2 [25]. 
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2.3 Galerkin’s Method for the Boundary Integral Equation 

Now we formulate the Galerkin’s method for this boundary integral equa- 
tion. We specify the conditions on the boundary F and its triangulations in a 
standard manner (see [23]). Let the boundary be a closed strong Lipschitz 
surface and let F be partitioned into finitely many pieces Si with F = Si 
where each of the 5/ is the image of an analytic application y = {rj) de- 
fined on a polygonal domain F; E . The are supposed to have analytic 
continuations beyond Vi . The triangulations on F are obtained as images of 
regular quasiuniform triangulations of the parameter region V/. All bound- 
ary elements of one surface piece Si can be obtained by the composition of 
F(^) and one linear mapping together with translations a{ applied to a 
reference element 



7T = { 2 : € E^ : 0 < < 1,0 < 2^2 < zi}. 

Then 

® + ai) 

defines the in general curved elements of Si. Each element 7rf belongs to one 
and only one surface piece Si. Thus we omit the index I at Trf = The 

approximating boundary element trial spaces are now defined by lifting finite 
element functions from the parameter partitions of Vi onto F via • As 




Fig. 2.1. Mapping of the reference element onto the original element 



approximating subspace Vh of Vr we take piecewise polynomials of different 
degree for the approximation of V 2 or t\. For the definition of the nodal 
basis of Vh we select on F a number of nodes, : N = l,--,n^}, 

for approximating the boundary tractions and a second number of nodes, 
: N = 1, • • • ,n#}, for approximating the boundary displacements and 
assume that the first (n^,i) nodes belong to F\. Suppose that there is 
a (scalar) standard local basis function (^^*^) associated with each node 
x^ (x^^). Since the transformation is invertible, local basis functions on 
TTi can be defined by 
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O = ${i) o (x^)-!, \/x £ 7r<. 



Thus the boundary element bases are defined by 






1^(1) 

#(i) 

1^(1) 






1 = 



^(^#,1+1) \ 
^(^#,1+1) I ^ 

^(n^ i + l) J 



and 



Vh = span ^ X span ^ 



is taken as approximating subspace. 



Problem 3 (Galerkin Scheme (exact integration)). 

Find ^ Vh such that 



$(^#) \ " 

I 

J 



D 22 K[^ 

-K21 Vn 

-D\2 
I + Kn 





(2.8) 



Lemma 2.1. [26] 

For the exact solution (^^) € M^{F 2 ) x H^“^(ri) (a < 1) the Galerkin 
approximation with exact integration (^^) € Vh satisfies the error estimate 




\w{r2)xw-^(ri) 



< ch^-^ 




l]l^(r2)x]H— 1(A) 



if t < I, —1 < t < s < a. The highest order of convergence is ch^'^^. 



Next we write down the Galerkin scheme in the form of a linear system 
of algebraic equations for the unknown coefficients (^^) in the spline basis 
(where Xi is the vector of nodal tractions and X 2 is the vector of nodal 
displacements) 



{ All 


-Ai 2 \ 


( 




V A21 


A22 J 


V A2 


) 



)^( 



Fi 

F 2 



We note that An and A 22 are both symmetric and positive definite and that 
Ai 2 = A 21 [19]. This system can be solved by elimination of Xi to build 
the Schur complement of the system matrix. The resulting reduced matrix is 
symmetric and positive definite. 
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3 Integration of singular and regular integrals 



One drawback of the symmetric Galerkin BEM is the necessity of cal- 
culating weak singular, singular and hypersingular integrals. However, the 
combined analytical and numerical integration of such integrals removes this 
drawback (see, e. g., [10, 21, 6]). The combined numerical and analytical in- 
tegration method for four-dimensional Cauchy singular integrals is based on 
the fundamental work from Sauter [21]. The introduction of relative coor- 
dinates with respect to the original coordinate system leads to complicated 
integration bounds and a lot of technical effort (see [21,6]). Therefore, the im- 
plementation of the proposed method for the direct computation of Cauchy 
singular integrals, which seems to be more efficient than other methods based 
on regularizations [11, 14], which lead to weak singular integrals, is for the 
kernels of elasticity rather expensive. These disadvantages are avoided in this 
work by introducing relative coordinates in the parameter plane for the ref- 
erence element. 

For generating the Galerkin system, the following finite part integrals 
have to be evaluated 



{^22^h 2 (r2)xH2(r2) 

= / P-f- / ds(j/) ds(x). 

Jr2 Jr^ 



Remark 3.1. The shape functions have to be linear functions or poly- 

nomials of higher degree. 

This integral can be regularized by using the identity 

p.f. ^ S{x,y — x) ds{y) = 0 Va; G T. 

The result is a sum of integrals with a reduced order of singularity 



[ p-f- [ S{y,x)(p^^\y) ds{y) ds{x) (3.1) 

JTi JTi 

= [ P-f- [ S(y,x)[ipf\y) -ipf\x)] ds(y) ds{x) 

Jr 2 Jr 

/ m ds{y) ds{x) 

•/supp Jsut>v> 



f ds{y) ds{x). 

Jsupp (pj^^nsupp ^ Jr\s\ipp 



f'h ' 'i'h Jr\supp (fil ‘ 

Therefore, we have to compute integrals of the type 



I = [ p.f. I^{x)ds{x), 



(3.2) 
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where 




kn{x,y - ^(x),(x^^^) H!/))ds(y) 



and 

|a|>max{0,n} 



(3.3) 

(3.4) 



where 



^0 



denote the derivatives with the multi- 



index a e with respect to the second argument of and Be{x) := {y : 
1^ — x| < £:} for e > 0. For n = 1 (hypersingular operator) the regularization 
(3.1) leads to |a| > 1 in (3.4). We denote by kn{x^y - x) a component of 
the kernel functions U (n = -1),T (n = 0), or 5 (n = 1) with the order of 
singularity -2 - n. The components kn of the kernel functions U,T,S (and 
also other kernel functions, which arise from any integral equation formulation 
of a strongly elliptic boundary value problem) can be written in the form 



f^max 

kn{x,y -x) = E E c«(a;)(y - a:)“, (3.5) 

«^=0 

where 2 -h n 4- ac is an odd number and 

3 

={y-x)-{y-x) = E^yi ~ 

i=\ 

The coefficients Ca{x) are supposed to be piecewise analytic. The shape func- 
tions ^ in (3.3) are products of elements of the matrices Ni^N‘ 2 . The cases of 




Fig. 3.1. 1 : coincident, 2: edge-adj., 3: vertex-adj., 4: regular case 



(1) coincident elements, (2) edge-adjacent elements, and (3) vertex-adjacent 
elements are potentially singular (see Fig. 3.1). In all of these cases the inte- 
gration method for (3.2) consists of the following steps: 
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1. Introduction of relative coordinates. 

2. Representation of the integration domains such that the integration over 
the relative coordinates is the outer integration [21, 6]. 

3. Consideration of the union of symmetric integration domains and ex- 
ploitation of the parity conditions [12] kn{x,y — x) = (-l)^'^^A;n(x,x-y) 
of the kernels. 

4. Introduction of Duffy’s triangular coordinates u = for the re- 

maining weak singular integrals. 

5. Computation of the outer regular integral by using standard Gaussian 
quadrature rules. 

We describe these steps in detail for the case of coincident elements and give 
the results for the other singular cases in the next subsections. In the last 
subsection we describe the adaptive numerical quadrature in the regular case. 



3.1 Coincident elements 



First we describe the case of coincident elements, i. e. Trf = ttj . We suppose 
(for simplicity) that the mapping X = x^*’ = x^^^ is analytic. Then we have 

Xiiv) = E * = 1, 2, 3. (3.6) 

aeNl 

Let us now write the finite part of the inner integral in (3.2) in local coordi- 
nates for an arbitrary x G interior 

Lemma 3.1. If x ^ interior{ir^) , or ^ £ interior{Tt)j then 

p.f. I^ix) = p.f. J^iO = V.p. 4(0 < (7i|ln(dist(^,a7r))| +C 2 



holds, where 



eiO ■■= f 

Jn\ 



kni^,V - 

kni^,V-0 = I^n{xi0,xiv) -x(0). 



(3.7) 

(3.8) 



where J denotes the JACOBIAN determinant The positive constants C\,C 2 
do not depend on e. 



Proof In general, we have [23, 22, 13] 

p.f. I^{X) = p.f. + E Ca(a(£>2“^)(^.6- 

0<|a|<n 

If kn arises from any integral equation formulation of a strongly elliptic 
boundary value problem and x £ interior(7rf ), then c«(^) = 0 holds. We 
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(3.9) 



The transformed kernel kn is pseudo-homogeneous [13], i. e. for every N 






j=0 



(3.10) 



where 

l«N(C,t7-ai<C'jvr^-"-' (3.11) 

and are analytic 27 t periodic functions in 0, which satisfy for /c < —2 

and AC > 0 the compatibility conditions 

/ /k(^. cos“' e sin"= edB = 0 for | [“[ T ^ ^ o 

Jo I PI = — ^ — 2 II AC < —2. 

By using the parameterization r = ii(0) of the boundary dit with respect to 
the variable 0, we obtain 



n R{e) N 
j=o 



6rdrd9 



^ ^(jD5^^)(^,^)rl"l cos“i 6»sin 

|o;|>max{0,n} 

+ [ RN{^,V-OH^,v)dv 

JABcH) 

Y, f " fn-M,e) cos“' 0sin“= d (3.13) 

la|>max{0,n} ’ ^ 

f In jR(0) — In £: if |a| = n — J 

\ (|al — n -h otherwise 

U\BAa 



From (3.12), (3.11), it follows 



Ji‘(a<Ci|ln(^mm^fl(0))| + (72, 



(3.14) 



where the positive constants C\ and C 2 do not depend on e. 

Then lime_vo JeiO =• v.p. Jq (0 exists and is called Cauchy principal value 
and the assertion is proved. 
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Remark 3.2. In general there is an explicit dependence of strong (n = 0) and 
hypersingular (n = 1) integral operators upon the shape of ^-neighbourhoods 
used to define the finite part integrals, even if the integral exists as a Cauchy 
principal value. Kieser shows in [12] that for operators defined by boundary 
potential operators and a suflSciently smooth diffeomorphism x transforming 

= {?? e : Ih -^11 < f{^,d ),0 <e< 2 ir} 

onto 

BliO = {»? e M" : ||»? - ^11 < q{£, 0),0 <e<27T} 
for all 0 < e < eo with some £o > 0, where local polar coordinates in the 
parameter plane are used, one finds that 

p.f. f kn(^,V - 0 dr) = p.f. [ dr). 

By using (3.6) we obtain the expansion 

~kni^,V - a = E - 6“ (3.15) 

|a|>/c 

for the transformed kernel. 



Theorem 3.1. Let the following suppositions be fulfilled: 

1. 7T^ = TTj — x(^); where x is analytic, 

2. the kernel function kn is of type (3.5), and kn is defined by (3.7), 

3. the shape functions $ are piecewise polynomials, i. e. 

E (3.16) 

|a|>max{0,n} 

Then the singular integral (3.2) is equal to the following regular integral 
(which has bounded integrands) 



pl-wiW2 nil . 

+ / {Kn^)UMJ^i,))d^2d^r (3.17) 

J Wlil—Wo) Wi(l—W2) ^ ' 



U>2) d Wi(l—W2) 
pl-WiW2 P$l-\-Wt(w2-l) 

+ / / (Kn^) d42d^Awidwidw2 

Wi{l — W2) do ' 



-wi)li(i),«^iCj) (3.18) 

+{Kn^) +«;i(l -u; 2 )(j),«;i(jlj) 

+(X„#) ((1 - «;i)|i (IJ + a;i(l - u>2)(;), tni (7)) } 

x(l - w{fwilid^2diidwidw2, 
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where 

{Kn^){i,ri) := + l) + hi^ + V, + V,0 (3-19) 

Proof. By using Lemma 3.1 we can write the singular integral (3.2) in the 
form 

I = [ v.p. 

J it 

= f lira f ~ 0 H^,v)drid^ 

= lira f f r)-0 HC, v)dvd^ 

We introduce the relative coordinates u — rj - ^ to obtain 

I = liia[ [ kni^,u)i{^,^ + u)dud^, (3.20) 



where 



^iO = {w € 6 < Ui < 1 - ^1,-^2 < W2 < Wi + 6 - ^ 2 }, 

Be(C) = {weM2 :||w-C|| 2 <e}. 



Next we describe the integration domain, such that the integration over the 
relative coordinates u is the outer integration. (This step was performed 
for the original element in [21, 6], but the representation of the integration 
domains with respect to the parameter plane (reference elements) is much 
easier.) We decompose E x G(^) into the domains 

El = {-1 <Ui <t),Ui <U2 < 0,-Wi < 6 < < 6 < 6 - 

Es = (-1 <ui < 0,0 < U 2 < ui 1, Au < ^i < 1,0 < ^2 < 6 - 

E^ = {—1 < U\ < 0,-1 < U2 < Ui,—U2 < < 1, —U2 < ^2 < 

E 2 = {0 < Ui < 1,0 < U2 < Ui,0 < ^l < 1 — Ui,0 < ^2 < 

E 4 = {0 < U\ < l,Ui - 1 < U2 < 0, -U2 < < 1 - Ui, -U2 < ^2 < 6}? 

^6 — {0<Ui < l,Ui <U 2 < 1, Au <^1 < 1— Ui,0<^2 ~ Au } . 

{Au := U 2 — u\.) The Cauchy principal value, i. e. the limit in (3.20), 
exists only for the whole integration domain E xG{(), but does not exist for 
arbitrary parts of the integration domain. Nevertheless the limit 

lim lim f j 

e-^O e-^0 J J(^E2i-iUE2i)\B,(0) 



kn{^,u)3{^,^-l-u)d^du (i = 1,2,3) 
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exists because of the symmetry of the domain Ei and (i = 1, 2, 3). For 
proving this assertion we estimate the regular integral (s > 0): 






kni^, u)S{^, ^ + u) + kn(^ + u, + u, ()d^du. 



J JEii\B^{0) 

By taking the suppositions 1.-3. into account, we obtain for the integrand 
functions 









H^>0 + T,x>iP\^u), n<0 

^(^>0 + Ex>iPx^(^,u), n<0 

-P?\^,u) + Ea>2Pa^^(^> w)> n = 1 



(3.21) 



kni^,u)= ^ ^ Ca(x(0)(x(^ + W)-X(6)“. 

/C=0 

l^rrtax 

kn{^ + W, -U) = ^ ^ (Ca(x(^)) + 

/c=0 



(3.22) 



where 



PaH^>w) := ^3y(0«i«2 ^ (» = 1. -.5)- 



In the case n = 0, the summands 

5o = e-"-" E c«(x(^))[(x(^ + «) - x(^)“ + (x(6 - x(^ + y)rm, 0 

\a\=K 

= 0 

vanish because k is an odd number. In the case n = 1, the summands 

5i = E Ca(x(a)[(x(^ + «) - x(a)“ - 

|a|=K 

-(x(0 - x(^ + w))“]pf’(^.«) = 0 



vanish because k is an even number. Because of (3.21) and 

\m\<Ch-^Q (i = i,2), 

|(x(e + «) - x(a)“i < (C = const, C{Q,h)) 
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all of the remaining summands of are obviously weakly singular. By 
introducing the Duffy’s triangular coordinates u = ^ == 

(u.^u-T-i)) ^ we remove the weak singularity and 

obtain the regular integral (3.17) by summing up. Equation (3.19) follows 
from (3.17) by the following variable substitutions with respect to the three 
summands in (3.17) 

6 = (1 - 6 = (1 - 'w^i)|i|2; 

= (1 - Wi)ii W2), 6 = (1 - Wi)ii^2 +Wi{l- W 2 )] 

= (1 -u;i)^^i +Wi{l-W2), 6 = (1 - w^i)66. 

Now all assertions are proved. 

Lemma 3.2. Let the mapping x linear, and let the suppositions of Theo- 
rem 3. 1 be fulfilled. Then the integrands of (3.1 7) are polynomials in 
and the integrations of (3.17) can be performed fully analytically. 

Proof The shape functions <P(-, •) are obviously polynomials in W 2 - 

Therefore we have to consider kernel functions of the type kn{^ H- wi(,wirj). 
By using the linearity of x we obtain 

x(^ + (C + »?)) - xi^ + C) = m xiv) 



and 



= llx(€ + Wi(C + v)) - xi^ + W1OII2 = wlWxivWl =■■ wIq^{t]). 



Hence 



f^max 

kn{i + WiC , WiT }) = ^ 

Kr=0 






where Ca(.) is analytic, is a polynomial in ^ = (6?^2)^- Thus the integrands 
of (3.17) are polynomials in ^1,^2- In (3.17) the terms, which have nega- 
tive exponents of wi, vanish, because the integrands of (3.17) are regular 
(see Proof of Theorem 3.1), such that the integration over w\ can be 
performed analytically. The remaining outer integrals over W 2 are of the type 



/■ 



qn{w2) 

-h bw2 + 



rdw2 



(3.23) 



where qn is a polynomial of degree n and a, 6, c are constants. This integral 
(3.23) can be evaluated by standard variable substitutions [1]. 

Now we propose a Gaussian quadrature scheme suitable for integral 
(3.19). In and w\ we apply the standard error estimates for Gaussian 
quadrature formulae in terms of the derivatives of the integrand. The error 
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for the ^ 2 - and i(; 2 -integration will be estimated by using derivative-free esti- 
mates based on the domain of analytic continuation of the integrand, i. e. we 
use ideas of Schwab/ Wendland [23] and Rabinowitz / Richter [20]. 
But the application of derivative-free estimates in the case of four-dimensional 
Galerkin integrals seems to be new. 

We substitute = h~^Cu 6 = C 2 , wi = W 2 = (a in (3.19) 

to obtain h 1 h 1 

f f f f /(Cl,C2,C3,C4)dC2dCl^^C4dC3, 

^0 Jo Jo Jo 
where is independent of h. 

Theorem 3.2. Denote by Q^[f] = /(C/) ^he Gaussian quadra- 

ture formula of order N in ^ for the interval [0, ij. Then there exist constants 
(5>0 and C > 0 independent of h, such that 

(3.24) 

where I is defined in (3.17) or (3.19). 



Proof With denoting the integration with respect to C? the error E of 
numerical integration is 

\E[h'^f{hii,i2,hwi,W2]\ (3.25) 

We reduce the estimate of (3.1) to four one-dimensional estimates of the 
errors := etc., as usual: 

\E[h^f]\ 

— h \{Iw2 "I" ^u'2) ° i^wi d- Ewi) o (^I'j d- E^^) o 4 - E^^)[h‘^ f] — I[h‘^ f]\ 

4 

i=l 



where 



[/] = . sup 

^2,U»1,U»2€[0,1] 

Rsif] = sup 



\EM 



i? 2 [/] = , sup |E^-J/]|, 

|i,u»i,if;2€[0,l] 



R4[f]= sup \E^^Af]\. 



Ii,I^2,u»2G[o,i] |i,C2 5tyi€[o,i] 

The errors R\ , Rs can be estimated via the Taylor expansion by using the 
degree of exactness: 
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2N 



Rl[f]<CN^ ^ sup 

2N 

w i ^ y 



< Q \‘ - - 

— 1 f{h^iA 2 ,hwi,W 2 ) 



ii) 



f 



iLce(G) 



RsU] < 




(3.26) 

(3.27) 



where G = [0, h] x [0, 1] x [0, h] x [0, 1]. To estimate the terms /? 2 , we apply 
in this variable derivative-free error estimates (see [20]). Therefore we have 
to estimate ellipsoids of analyticity for /. Since # is a polynomial, it suffices 
to consider fc„. In local polar coordinates (3.9), the domain of analyticity of 

will be restricted by the zeros of and of 



l 2 { 0 ) = c{A cos^ 6 + 2 COS 7 COS0 sin0 -I- A * sin'^ 6 ) 



corresponding to the first fundamental form of the surface T with respect to 
^ or ^ + u. From (3.19) we have the following three cases correspondent to 

the three summands of / ; 



1 . = wlil + wl) = 0 =+ {W 2 )i ,2 = ±i 

1 ^( 6 ) = cw?r“2{A -H 2COS7W2 + A-^wl} = 0 (^ 2 ) 3 , 4 = -Ae=^*'^, 

2. = Wi(2w2 — 2w2 -I- 1) = 0 => (^ 2 ) 5 , 6 = 2 ^^ 

12 ( 6 ) = cwir~‘^{Xw 2 -f 2cos7iy2(w2 - 1) + X~^{w 2 - 1)^} = 0 

, / N _ l-l-Ae^--' 

^ \^2)7,S A^+2Acos7+1’ 

3. r ‘^ = u;?(l + wl ) = 0 =+ (^2)9, 10 =±i = (^2)1, 2 

12 ( 6 ) = cwlr~'^ {Xw 2 -i- 2 cos71i; 2 -I- A“^} = 0 (w;2)ii,i2 = -A“^e=‘=*'^. 

Next we map [0, 1] onto [—1, 1] via C = 2u)2 ~ 1- 

Cl, 2 = C9.10 = ± 2 i - 1 , C3.4 = - 2 Ae^*^ - 1 , Cs.e = ±*, 

±i(2Asin7)-yi^,. = 

'»7.8 A^ + 2Acos7 -I- 1 

In the same manner as in [23] we can construct ellipses Es» in the complex 
plane having foci ±1, such that C< 6 BEg^, where <5? is the sum of the semh 
major and semi-minor axis of Eg» ■ We choose 5 = mini 6 ^ , such that / is 
analytic in D [— 1, 1]- Then 

FUlf] < (27r)2c4e~*'^'“'^^^ sup |/|. 



(3.28) 
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The function / is analytic in I 2 , such that 

Hf] < (3.29) 

holds for any J > 1. The assertion follows from (3.26), (3.29), (3.27), and 
(3.28). 

Theorem 3.2 gives the orders N and M of the Gaussian quadrature to obtain 
an error of a prescribed consistent order of h. In order to satisfy the consistent 
estimate \E\ < Ch^ for given a > 0, one should choose [23] 

N>(a- 2)/2 and M > ^N. 

3.2 Edge-adjacent and vert ex- adjacent cases 

For the “weaker singular” edge-adjacent and vertex-adjacent cases a similar 
integration method is proposed. We give formulae with regular integrands for 
the computation of the singular integrals. In [21] similar results are obtained 
for polygonal bounded domains. 

Theorem 3.3. Let x £ and y £ tTj be two elements with adjacent edges, 
such that 

xW(0) = and x«((l)) = 

and let all other suppositions of Theorem 3.1 be fulfilled. Then the singular 
integral (3.2) is equal to the following regular integral (which has bounded 
integrands) 

+ (kn^){ri^‘^\^^^'>)'^d^idv2duidu2 (3.30) 

u^here f) = («^), ^ 

and (fc„#)(^,7j) ;= fc„(^,Tj - 0^iC,v) with xiO = xiv) = X^^H^) 

(see (3.7), (3.8)). 

Proof. By using the existence of p.f. Ig{x) Vx £ tt^ (see (3.3)) we have 

I = [ p.f. I^{x)ds{x) = lim [ p.f. I^{x)ds{x), (3.31) 

where Mx{e) := {x £ : dist(x, Onj) < e}. In a similar way as in the proof 

of Lemma 3.1 (see (3.13)), we obtain 

p.f. Ig(x) = Io(x) < Ci| ln(dist(x, Onj)] -f C 2 Vx ^ Trj*', 



where the positive constants C\,C 2 do not depend on e. Hence 




Galerkin BEA for 3D Elasticity Problems 237 



^ ^ -is?. [ ioi^)Mx), 

y7rf\M.(e) 

where the integrand Iq{x) is weak singular, such that 

^=^iS?.// {kn^)i^,r])dr]d^, (3.32) 

J J (nX7r)\M^(e) 

where 

•= € 7T X 7T : ^2 < e}. 

Now the goal is to remove the weak singularity by applying three-dimensional 
triangular coordinates. This requires to split up the integration domain into 
generalized pyramids. After decomposition of x tt in 

£^1 = {0<6<1, 0<e2<6, 0<m<6, (3.33) 

E 2 = {0 < 7^1 < 1, 0<rj2<T]u 0<^2<Vu 0<6<6}, (3.34) 

we introduce the relative coordinate ui = - rji for the integration over Ei . 

The domain of integration can be described hy E\ = EnU Eri, where 

Ell = {0 < ^2 < 1,0 < ui < ^2,0 < rj2 < ^2 - ui , ^2 < ii < 1}, 

^12 = {0 < ^2 < 1,0 < ui < 1, 

max{0,^2 -ui} <f }2 < 1 -uuUi 4-7^2 <6 < !}• 

are generalized pyramids. The variable substitutions wi = ui 4-772 in En and 
Wi =ui 772, W2 = Ui 4 - 772 - ^2 in Ei2 lead to 

' = ^st,{ I' c r £ (*”*) 

+ -K,d«„dmdvn}. 13.35) 

By using the three-dimensional triangular coordinates 

^2 = t;2,7x;i = V 2 ^^i ,^2 = V 2 U 1 U 2 for the first integral, 

wi = V 2 ,W 2 = V 2 U 2 ,t ]2 = V 2 U 1 for the second integral, 

we obtain 

Jo Jo Jo Jv 2 ^ ^ 

where the integrand is non-singular. Since the integration domain E 2 is of 
same type cts J^i, if we exchange ^ and 77, we finally obtain (3.3). 

If the mappings ^re linear, then the integrands of (3.3) are polynomi- 
als in and V 2 and the correspondent inner integrations can be performed 
analytically. 
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Theorem 3.4. Letir^ andiTj be two elements with one adjacent vertex, such 
that 

and let all other suppositions of Theorem 3.1 he fulfilled. Then the singular 
integral (3.2) is equal to the following regular integral (which has hounded 
integrands) 



/ = 




du\du2dusdu4, 



(3.36) 



where 




T] = 



' UlUs \ 
UIU 3 U 4 ) ’ 



(3.37) 



and {knS){^,rj) := kn(^,V ~ with xiO = xiv) = X^^Hv) 

(see (3. 7), (3. 8)). 



Proof Following the lines of the previous proof we also obtain in this case a 
weak singular integral (see (3.32,3.33,3.34)) 

I = \im [ [ {k„^)i^,T])dr}d^,M^{e) := € tt x # : < e}. 

J J (EiUE2)\M^(e) 

Now we consider E\ and introduce the generalized triangular coordinates 
(3.37) to obtain the first summand of the integrand in (3.36). Since Ei is 
of same type as Si, if we exchange ^ and rj, the second summand of the 
integrand follows in the same manner. 

The Gauss- Jacobi quadrature rule is suitable for the integrals (3.3), (3.36), 
since the integrands are of the type w^f{w) [9]. The error estimates are 
obtained in the same way as in the proof of Theorem 3.2 because of the use 
of Duffy’s triangular coordinates. 



3.3 Integration of regular integrals 



Let us briefly consider the regular case 4 (see Fig. 3.1). Using the notations 
(3.7), (3.8) we have to compute integrals of the type 



f f f f 0^n^)i^y‘n)dmdrfid^2d^i- 

Jo Jo Jo Jo 

By using the triangular coordinates = ^ 1,^2 = = '^i^V 2 = ^ 1^2 we 

obtain 



1 = 




dv2dv\du2du\. 



We approximate I by product formulas and use Gauss-Legendre quadra- 
ture rules with respect to U 2 ,V 2 and Gauss- Jacobi quadrature rules with 
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respect to It is easy to prove that if the one-dimensional rules have 

degree d, the product formula has also degree d [24]. The degree d, which is 
necessary to obtain a given error bound can be roughly evaluated by 
estimations minimal distance of the elements and the size of the 

elements H [15]: 



(d-f 2) 



H 






^ ^int* 



In the near field, subdivisions of the elements are necessary to avoid quadra- 
ture rules of very high order [15, 2]. Furthermore, integration methods for 
nearly singular surface integrals arising in the collocation method (see [7]) 
can be adapted to the Galerkin method. 



4 Numerical example 

The Leon problem (unbounded elastic domain, u = 0.3, with a spherical 
cavity under ID tension) is approximately analyzed using the domain i? (see 
Fig. 2). In all of the numerical examples, piecewise linear shape functions 




Fig* 4*1, Domain S7 



were used for the approximation of the displacements and piecewise constant 
shape functions for the tractions. In this case, some inner integrations can 
be performed analytically (see Table 4.1). Fig. 4 shows the approximated 
tractions (th)z in ^-direction within the plane z = 0. The correspondent 

error is e = = 0.0385. Table 4.2 shows the CPU time (SUN 

\\tz\\L2 

SPARCstation 10) of the matrix generation and the shares of the CPU time 
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Table 4.1. Number of analytical and numerical integrations 



Case 


analytical integration 


numerical integration 


coincident elements 


3 


1 


edge- adjacent 


2 


2 


vertex-adj acent 


1 


3 


regular 


0 


4 



Traction (si=0) 




Fig. 4.2. Tractions th,z within the plane z = 0 



for the evaluation of the integrand function in each case. The percentage of 
computational labour consumed by the singular integration decreases rapidly 
in comparison with the regular integration. 



Table 4.2. CPU time in sec. of the matrix generation 



Nodes 


Elements 


DOF 




coincid. 


edge 


vertex 


regular 


60 


116 


193 


9.64 


0.4 


2 


15 






202 


344 


13.94 


0.7 


4 


33 






464 


805 


9.64 


1.6 


11 


61 


KSH 


406 


808 


1419 


13.94 


2.8 


19 


117 


2073 i 



5 Conclusions 

The mixed boundary value problem in three-dimensional linear elasticity is 
solved via a system of boundary integral equations. Differing from the well- 
known collocation approach, the symmetric Galerkin BEM is used as ap- 
proximation procedure. 

The main result of this paper is a new general integration method for singular 
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double integrals, which can directly be applied to GALERKiN-type boundary 
element analysis of a wide range of BIEs on domains with curved (piecewise 
polynomial) boundary surfaces. Arbitrary polynomial shape functions can be 
considered. The singular double integrations for the symmetric Galerkin 
BEM can be computed by a semi-analytical method efficiently in the case of 
polygonal bounded domains. 
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Parallel Coupling of FEM and BEM for 3D 
Elasticity Problems 

Sz. Szikrai and E. Schnack 



1 Introduction 

In this paper we make use of the advantages of mixed variational FE/BE 
discretization and propose a parallel algorithm for solving the linear system 
of algebraic equations arise from discretizations of three dimensional linear 
elastostatic problems. The applied coupling technique was published in 1985 
by Schnack [9]. The discretization leads to a linear system of algebraic equa- 
tions, that can be reformulated in symmetric and indefinite form. This is very 
advantageous, because several efficient equation solvers can be applied. We 
can use Bramble/Pasciak’s conjugate gradient method [1], or we can derive 
a positive definite and symmetric system (with building the Schur comple- 
ment) and solve it with standard direct or iterative methods. Here, we solve 
the Schur complement system parallel, based on recursive substructuring and 
LU factorization. To decompose the mesh into subdomains, we use the sim- 
ple and efficient method of Farhat [4]. To obtain the BE Schur complement 
matrices, one has to solve a Neumann boundary value problem in each BE 
subdomain locally. The BE discretization results in a linear system, where 
the solution is increasingly sensitive to perturbations (e.g. roundoff errors) 
and one hcts to resort to special techniques. We use the singular value decom- 
position (SVD), that enables us to intelligently handle the problem and to 
compute the solution to the linear system. 

The parallelized algorithm presented is well suited for computations on 
MIMD computers with local memory and message passing principle (e.g. 
transputers). The communication architecture is a tree in the finite element 
domains and a hyper cube in each of the boundary element domains. 



2 The coupled 3D FE/BE variational formulation 

2.1 Basic equations of linear elasticity 

Let us consider a continuum in a bounded, three dimensional domain i? C 
with a Lipschitz-continuous, piecewise smooth boundary F = dft. We con- 
sider linear elastic materials, where the material properties are independent of 
position, i.e. the material is homogeneous, and the behavior of the material is 
the same in all directions, i.e. the material is isotropic. The boundary value 
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problem is formulated by Navier’s equation for the unknown displacement 
field Ui\ 

^ U = '^j,kk “b j ~ (^*^) 

where i/ is the Poisson’s ratio. Equation (2.1) is a second order partial dif- 
ferential equation of elliptic type. 

The solution of (2.1) has to fulfill the boundary conditions: 

- Dirichlet (or first type) boundary condition 

= 9h £ Fd = F\Fn, ( 2 . 2 ) 

— Neumann (or second type) boundary condition 

(Tij{u)nj = ti, Vx € Fn, (2.3) 

with the stress field a{j and the outward normal Uj. The Dirichlet and Neu- 
mann boundaries are denoted by Fd and Fjq^ respectively. With (2.1), (2.2) 
and (2.3) the mixed boundary value problem is completely defined. In the 
next section we reformulate (2.1) as a mixed domain decomposition varia- 
tional problem with domain and boundary integrals. 

2.2 DD variational formulation with non— overlapping 
substructures 

There exist many techniques and ideas to reformulate the boundary value 
problem (2.1) as a mixed domain decomposition coupled domain and bound- 
ary integral variational problem, cf. [3], [13], [8] and others. Here we follow 
the method described in [9], [12], [5]: 

Let us divide 17 into p non-overlapping, simply connected subdomains 



Hi (i = l,2,. 


..,p) called also substructures such that 




with 


17 = l7i? U Qb 

q p 


(2.4) 


where 


Qp ■= A 2tnd 17b := A, 

i=l 


(2.5) 




Hi U f2j = 0, Vi ^ j. 


(2.6) 



The FE-substructures are denoted with the first q indices, and the BE- 
substructures (the so-called macro-elements) with the p — q indices, respec- 
tively. Let us define some coupling boundaries (interfaces): 



— Global coupling boundary: 

p 

Fc := IJ Fi\r, with := dOi, i = 1, 



(2.7) 
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~ FE/BE-coupling boundary: 



ro:=dnFf]df2B. (2.8) 

— BE-boundary (skeleton of the macro-elements): 

p 

Fb - U Pi. (2.9) 

- FE-boundary: 

Ff := Fc\Fb. (2.10) 

Further on, we suppose that 

FBf]FN = 0 ( 2 . 11 ) 

for the simplicity. In the case that some macro-element boundaries contain 
parts of the Neumann boundary Fn, the proposed algorithm have to be 
slightly modified, see [5]. 

The energy test space corresponding to will be denoted by 

= {v€ HHOf) : = 0}. (2.12) 



On the skeleton we introduce the mortar functions 

u £ ^H^/^{Fb)^ :={w = u)|rB : w £ (//^(/2))^ , 

l|w||(jji/2(rg))3 := inf ||w||(Hi(fi))3}, 



(2.13) 



and the product space 

U= {(u,u) e (//i(12i.))" X (//i/2(rs))' , u|ro = uko) (2.14) 

equipped with the norm 

I|(v,u)||u := inf{||w||(Hi(fi))3 : w £ , w\np = v, w\fb = v}- 

(2.15) 

The test function space associated with U is denoted by 
Uo = {{u,u) G U : ^ = Mfb 



with G and t/;|rD — 0}. 

Let us introduce the trace spaces V and A on the skeleton: 



(2.16) 
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V = {i/ = : Vi e ^ ^ = g+l,...,p}, 



II 

II 

+ 


Ap) : Aj E 


[H-^/^iFi))" ,i = q+l,...,p} = 


= ylg+l X Ag+2 X 


... X A-p^ 


(2.17) 


With Ai = (^H~^/\ri)y , 




. . . ,p. Here, the subspace 



is defined as follows: 



J X e (H-^i\ri))^ ■. j xdr = o, j r xxdr = o 

V Fi Fi 




where r € denotes the distance vector to the origin. 

The coupled domain and boundary integral variational problem now 
reads: 

Find 



((u,u), A, i/)GV = UxylxV (2.18) 



with 



such that the global equations 

ap (u,v) + as i\v) = f{v), V(i;,e)EUo, 
the weak coupling conditions 

<u-Vi, T]i>ri = 0, e , q + i<i<p 

and the boundary integral equations 



SiVi = Ai, q-\-^ <i <P 

are satisfied. In (2.20) we used the following notations: 



aF{u,v) = XI / <^ki{u)eki{v) dn, 

*=i i 

P f 

aB{\,v) = XI j dr, 

«=9+l /. 

f{v) = I t V dr. 



(2.19) 

( 2 . 20 ) 
( 2 . 21 ) 
( 2 . 22 ) 

(2.23) 

(2.24) 

(2.25) 
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In equation (2.21) the duality product < . , . >ri- x 

(/f-i/2(/i))^ I — ). K was used as extension of the scalar product: 

<v ,\>r,= j v\dr, 1/ € , A e . ( 2 . 26 ) 

Fi 



Furthermore, in (2.22), Si : denotes the 

Steklov-Poincare operator, a Dirichlet-Neumann map, cf. [6]. It gives the 
relationship between the displacements i/* and the tractions on the bound- 
ary Pi of the i-th BE-substructure, which are related by the two boundary 
integral equations, cf. [6]: 



Fi — V{Xi + ^ 2 ^* 

Xi = f 2^* ^ 



or in matrix form: 



Xi 



- Ki 
Di 



Vi 

Ui + K'i 




(2.27) 

(2.28) 



(2.29) 



where Ci denotes the Calderon projector and Vi^Ki, K\ and Di are the four 
basic boundary integral operators: 

- Weak-singular operator: ViXi{x) = f U{x,^) Xi{^) dF^, 

Fi 

- Strong-singular operator: /^ii/f(a;) = / T{x,^) Ui{^) dF^, 

Fi 

- Transpose of the strong-singular operator: K'iAi(a:) = f T'(^,x)i/i(^)dF^, 

Fi 

- Hyper-singular operator: DiUi{x) = f S(x,^) Ui{^) dF^. 

Fi 

with the fundamental solution of Kelvin’s problem [/(x, ^) and the from U 
derived integral kernels T and S. The mapping properties on Sobolev spaces 
for these operators are well known (see e.g. [7]). 

From (2.27) 

Ai = V;-' (^/i + Ki)vi = Sm, ( 2 . 30 ) 

we get a Dirichlet-Neumann map, that defines the Steklov-Poincare operator 
on the trace space [H^/‘^{Fi))^ , see [6]. Alternatively, fi'om (2.27) and (2.28), 
one may also define the (symmetric) Steklov-Poincare operator as 



Ai = Dii^i + (i/i + K'i) y.-i (i/i + Ki^Ui = SiUi. ( 2 . 31 ) 
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The operator St is symmetric and -elliptic (i.e. positive def- 

inite) where 

:= I i/i e : Juidr = 0, Jrxuidr 

V Fi Fi 

Hence, Si admits a right-inverse Ui on , i.e. 

SiUi\ = \, VAe (2.33) 

and 

3 a,b : UiSiV - z/ = z/i^ = a-j-6xr, Vi/€ . (2.34) 

The operator Ui is the Neumann-Dirichlet mapping named Poincare- 
Steklov operator. It can also be expressed explicitly: 

Fi = (2^* ~ ^0 ~ (2.35) 

or 

= ViXi + - Kij Dri (i/i - K'i)Xi = U^i. (2.36) 

The existence and uniqueness of the solution of the variational problem 
(2.20)-(2.22) is proved in [5]. In order to solve the variational problem nu- 
merically, we discretize the subdomains and approximate the solution in a 
finite dimensional subspace Y/^ C V. In the next section we define the coupled 
FE/BE-discretization in the nodal basis. 




(2.32) 



2.3 FE/BEl— discretization 

First, we divide the subdomains fii {i = l,...,p) into finite elements, 
^ch _^at _^is discretization process results in a conform triangulation of 
n = i?irUi7^. Let us delete the FE-nodes and elements in the interior of the 
BE-subdomains i?* (i = g -h 1, ...,p), and let us form the BE>-discretization 
as a refinement of the FE-triangulization on the skeleton Fb- Figure 2.1 
shows an example, where tetrahedron finite elements (with quadratic basis 
functions to approximate the displacement field) and triangular boundary 
elements (with constant basis functions for displacements and for tractions 
as well) are used for the discretization. The tetrahedra generate the so-called 
coupling triangles (coarse grid with meshwidth H) on the boundaries of the 
BE-subdomains. The coupling triangles will then be decomposed into bound- 
ary elements (fine grid with meshwidth h). 
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Finite 




The displacement field u (finite element substructures) is approximated 
with piecewise polynomial (degree d>2) basis functions ^jj- 

= ^uU.F (2.37) 

with ^ 

» IkF = [M/ ■ (2.38) 

The vector Up is the usual finite element nodal displacement vector. Here 
and in the following the indices /, Cf and Cb mark quantities corresponding 
to the interior nodes of the FE^-sub domains and to the coupling boundaries 
Tfcj and Fbc respectively. 

The discretization of the traction field regarding the boundary elements 
is of the form: 

= span^A- C , q+i<i<p. (2.39) 

The displacement field Ui on is approximated (like the tractions) with 
piecewise polynomial (degree d' > 0) basis functions: 

= ^Vi Mvi. span^Vi c , i = q+l, (2.40) 

Here u^. denotes the nodal displacement vector. 
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The nodal FE/BE basis 



= [^u, 


(2.41) 


and the FE/BE subspace 


Y = Yh = VH X AhxVh 


(2.42) 


is now defined by the finite dimensional subspaces 


Uh = span#[/ C U, 


(2.43) 


Ah = span^yi C A, 


(2.44) 


Vh = span^v C V. 


(2.45) 



Once the basis 0 for V is chosen, we apply the Galerkin-method in the 
finite element subdomains and the 



— Galerkin method or 

— collocation method or 

— classical Neumann series 

in the boundary element subdomains to formulate the discretized version of 
the variational problem (2.20)-(2.22). This results in the system 

Ku = / (2.46) 

of coupled linear FE/BE-equations. 

Taking into account the arrangement of the unknowns given in (2.41), we 
can rewrite system (2.46) in the block form 

KjCf ^ICb ® \ / Uj 

KcpI KcfCb O UCf 

KcbI KcbCf Kcb O ILCb 

O O — T O S Uj^ 

O O O B -A J \ Uv 

By the elimination of Uy. in the BE subdomains we derive the following 
positive definite, but non-symmetric system: 




KiCf J^ICb ® 

K^CfI ^Cf KcfCb ® 

KcbI KcbCf Kcb 

O O -T H 



M/ 

MCf 

^B 

UA 



h 




(2.48) 



with 

H:=SA-^B = SL. (2.49) 

The quadratic form of the matrix H corresponds on the continuous level 
to the energy bilinear form, cf. [9]: 
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a{ui,Ui)- j (Tki{ui)eki{ui) dn 

Qi 

— j ^il'i dr 

Fi 

- u^. j Rb^ Nsi dr MVi (2.50) 

Fi 

= UAi SiUy. 

= U^. SiLiUj^. 

^ {Hi Ha,, Ha,), i = q + l,...,p, 

:u£{H^ (/2i))' ,«|-r< ^ (T,))' ,u\r^ = g}, 

: V £ (ri)^ ,J^ = Ui\r,}, 

: Ae (//-V2(r,))',A-V^-i (^ii + K^Ui}, 

i.e. the matrix H is symmetric and positive definite. The strict symmetry 
and definiteness can be achieved although only approximately on the dis- 
crete level. Approximation errors occur in the numerical computation of the 
displacement field Ui from the traction field A* by means of the boundary 
integral equation: 

Uy. = Li u^i > (2.51) 

where Li is the approximation of the Poincare-Steklov mapping Ui defined 
via the solution of (2.35) and/or (2.36) assuming that A^ is given. 

There are several possibilities to compute L^: 

— A) Galerkin approximation: 

In this case, we approximate the Poincare-Steklov operator Ui by via 
the Galerkin approximation of equation (2.35): 

Find i/f € Va = span^y. C (^HQ^^{ri)^ such that 

<ViXl,Xi>F,^<i\li + Ki)utXi>F„ VxfeVift. (2.52) 

— B) Collocation method: 

Here, the test functions Xi ^re the Dirac functions concentrated at suitable 
collocation points Pik ^ Li, fc = l,...,Mj = dimVih- We choose the BE- 
nodes as collocation nodes. The numerical problem to solve is as follows: 



Vuj G {u : Oi I — y 
E : Tj I — y 
Xi£{\:Ti^ R^ 
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(yi>^i)(Pik) = ^fi(pik) + {KiU^)^ik), k = l,...,Mi, (2.53) 

whose solution provides us with the mapping Af i/f. This again defines 
an approximate Poincare-Steklov operator: 

Af i/f = UgXt (2.54) 

— C) Neumann series: 

Since the spectral radius of - Ki) in the space orthogonal to rigid 
body motions, 

(i?o(^t)) •= {^i ^ {L/ 2 {ri))^ : J Vi dr = 0, J r X Vi dr = 0, \/r e Ri]. 

r n 

(2.55) 

is less than one, the classical Neumann series can be used to compute the 
inverse 

1 1 

i-h + Ki)-^ = (-/, - KiY (2.56) 

/=0 

which converges in [H§{ri)) ^ and provides an explicit tool to solve the local 
Neumann problems in the macro-elements, i.e. to compute the Poincare- 
Steklov operator 

Ui = {\ri + Ki)-Wi (2.57) 

as accurately as required, even on • We denote this approxi- 

mation by U^. 

This method was developed and efficiently implemented by Tiirke in [12]. 
The symmetry and definiteness of the matrix H can be controlled adap- 
tively. The algorithm was parallelized and implemented for 3D linear elastic 
problems, see [2]. 

For the definitions and the properties of the block matrices in (2.47) see 
[11]. The stability and convergence analysis of the discretized problem is 
described in [5]. 

2.4 The numerical solution of the local Neumann problems 

In the parallel algorithm we use the collocation method to obtain the dis- 
cretized Poincare-Steklov operator, because the generation of the BE ma- 
trices Ai and Bi is ’cheaper’ with the collocation technique, than with the 
Galerkin-scheme. In order to solve the local Neumann problems numerically, 
after generating the BE-matrices Ai and Bi, we have to compute the ma- 
trix Li (see equation (2.51)). This involves the invertation of matrix Ai. The 
solution is increasingly sensitive to perturbations (e.g. roundoff errors). We 
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apply the singular value decomposition (SVD), that is a useful tool to handle 
sensitive linear systems. The flop counts associated with the singular value 
decomposition algorithm is 4mn^+8n^, which is quite large. The advantage of 
the method is however, that it is more accurate than other techniques, which 
produce comparably inaccurate results when applied to large, ill-conditioned 
problems. 

Now we are interested in highly parallelizable and efficient methods for 
solving the system (2.48) of the coupled FE/BE equations. 



2.5 Solution techniques 

After eliminating Uy one can apply different parallel solution techniques to 
solve the block system (2.48) of linear equations: 

1. Bramble/Pasciak’s CG: 

Multiplying in (2.48) the first three block equations with (-1), we get 
the following symmetric, but indefinite system: 



Ka -Kac O 
-Kca -Kc -Kci 
O -Kic -Ki 




(2.58) 



with 



Ka = H, Kac = - 



Kc = 



^CA 

F fCb 
^CbCf ^Cb 



^)i /^ — Me — (MCf> MCb) 



, Kci = KJe 




or in simplified form: 



f Ki Kn \{ Hi L 

V K21 -K 2 )\H 2 J V 1 



(2.59) 



with the following definitions; 



Ki=Ka = H, Ku = = {-Kac 



Ko = 



Kc 

Kic 



Kci 

Ki 



/i=0, 



The (symmetric) coupled FE/BE-algorithm by Danger leads to a block 
system of the same form, cf. [8]. One can apply Bramble and Pasciak’s 
method to transform (2.59) into an equivalent system of linear equa- 
tions with a positive definite and self-adjoint^ system matrix, cf. [1]. In 
the transformation one needs an easily invertible, positive definite and 
symmetric matrix C \ , that is spectrally equivalent to the matrix K \ : 



^ with respect to a specially chosen inner product 
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tCi < J^i <7(7 i, (2.60) 

with positive spectral equivalence constants 7 and 7. Provided that be- 
sides Cl a preconditioner C2 for the matrix K 12 \s known 

(e.g. by means of the Additive Schwarz Method, ASM), the Bramble/ 
Pasciak’s conjugate gradient method can efficiently be applied to solve 
the transformed system. 

2. Iterative substructuring 

Another approach obtaining a positive definite and symmetric system 
from (2.48) consists in the elimination of the unknowns Uj and We 
obtain the full Schur complement system for the unknowns Uq\ 

( )(:::)=( I: ) 

with 

^Cf ^Cf ^CfI ^ I ^ICf^ 

ScfCb = J^CfCb - J^ICb^ 

ScbCf — ScfCbi 

Scb = ^Cb - KcbiKJ^Kicb + KcaK'X^Kac, (2.62) 

The matrix in (2.61) is symmetric and positive definite, thus the precon- 
ditioned conjugate gradient (PCG) method can be applied to solve the 
system. As preconditioning one can use the wirebasket-based algorithms, 
that are constructed for three dimensional FE-DD problems, cf. [10]. 

In (2.62) there are inverse block entries, namely KJ^ and . Therefore 
it is a good strategy to apply Bramble/Pasciak's CG for the system (2.59) 
in the case of large Kj and Ka matrices (e.g. there are only few processors 
with large memory), whereas for many substructures with few unknowns 
the Schur complement system (2.61) can be solved by means of the PCG. 
In the conjugate gradient method the system matrix doesn’t have to 
be available explicitly, but only in form of matrix-vector product. It is 
therefore sufficient to factorize the matrices Kj and Ka^ and to apply 
the factorized matrices to the right hand side in each CG-step. 

These algorithms are well suited for computations on MIMD computers 
with local memory and message passing communication principle. The 
h3rpercube seems to be the most suitable processor topology for both 
iterative solution techniques, cf. [8]. 

3. Recursive substructuring with static condensation 

If we use a direct method to solve the positive definite and symmetric 
system (2.61) of linear equations, we can’t avoid the expensive explicit 




Parallel Coupling of FEM and BEM for 3D Elasticity Problems 255 



computation of the inverse matrices and The advantage of 
the method however is in the recursion: With recursive organization of 
the computations can be achieved, that one has to solve systems with 
few unknowns at a time. Because of the recursion, the tree is the most 
suitable architecture for the communication of the processors. 

We apply the third solution strategy, to solve the coupled FE/BE-system 
(2.47) of linear equations. In the next section, the decomposition of i?F is 
presented. Then, in Section 4 we define the recursive algorithm and in Section 
5 we show some numerical results. 



3 Decomposition of the FE mesh in substructures 

In the recursive algorithm we need an automatic mesh decomposition al- 
gorithm, to achieve the FE subdomains from the original FE mesh. In our 
algorithm we implemented the automatic domain decomposer technique de- 
veloped by Farhat, see [4]. The benefits of this method are, among others, as 
follows: 

- The algorithm is suitable for two as well as three dimensional meshes and 
for arbitrary discretizations. It is able to handle eventually irregularities in 
the geometry as well. 

- Each subdomain has equal number (rieiem) of finite elements. In the case, 
that the number of finite elements Ne cannot be divided by the number of 
subdomains q without a rest, one subdomain gets less elements: {fieiem = 
Ne — [q — \)neiem)^ while the other q-\ domains consist of Ueiem = [^] H- 1 
elements. Therefore it yields a set of balanced subdomains. 

- The number of interface nodes is kept low in order to reduce the cost of 
synchronization and/or message passing between the processors. 

- It is easy to implement the algorithm into standard commercial FE pre- 
processors. The technique is simple and the storage requirement is low. 



3.1 Examples 

We have tested the above algorithm on some three dimensional finite el- 
ement meshes. For the discretization we have used tetrahedron elements 
with 10 finite element nodes and each FE mesh has been decomposed into 8 
subdomains. On the figures the thick lines denote the coupling boundaries. 
The number of elements {ueiem) stnd the number of FE nodes (interior and 
coupling: rinode —nj-^ nc) in the subdomains are summerized in a table. 
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Fig. 3.1. Decomposition of a cube and Leon’s cube into 8 FE subdomains 



1. Example: Cube (750 Elements, 1331 Nodes) 



Table 3.1. Distribution of finite elements and nodes of the subdomains (Ex. 1) 



No. 


Tlelem 


"Tlnode 


ni 


nc 


No. 




"f^node 


ni 


nc 


1 




223 


135 


88 


5 


94 


238 




MM 


2 


mm 


237 




127 


6 


94 


233 


Bi 


119 


3 


94 


244 


111 


133 




94 


238 


Wm 


IP 


4 


94 


236 


68 


168 




92 


272 


m 


|gi 



To outline the facts about the load balancing, we summerize some statistics 
(the minimum and maximum values, the averages x and the scatterings s) in 
another table. Here the following notations are used: 




s = ^— V (a;i - x)^ (n = 8). 
n — 1 

i=l 



Table 3.2. Statistics about the load balancing among the subdomains (Ex. 1) 





min 


max 


X 


s 


nelem 


92 


94 


93.7 


0.7 


nnode 


223 


272 


240.1 


14.2 


ni 


68 


135 


105.7 


19.1 


nc 


88 


168 


134.4 


23.6 
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Each subdomain (excepting the last one) has the same number of tetra- 
hedron elements, thus the scattering of the element number is of course low. 
There are greater scattering values in the distribution of the interior nodes 
(/) and the coupling nodes (C). 

2. Example: LEON’s problem (1356 Elements, 2179 Nodes) 



Table 3.3. Distribution of finite elements and nodes of the subdomains (Ex, 2) 



No. 




"f^node 


ni 


nc 1 


No. 


'f^elem 


"f^node 


ni 


nc 


1 






252 


ItfiKMl 


5 






191 


175 


2 




378 


161 


217 


6 


■1 




169 


189 


3 




356 


188 


168 




ni 


384 


151 


233 


4 




360 




138 




wsm 


372 


228 


144 



The statistics are similar to the first example. The scattering of the elements 
is again less than that of the nodes with one order of magnitude. A good 
load-balancing between the processors requires low scattering values of the 
relevant data. For example, in the generation of the stiffness matrices, the 
number of finite elements {rieiem) is relevant. When computing the FE Schur 
complement matrices Scf (see Equation (2.62)), the number of interior and 
coupling nodes are relevant, since the dimensions of the matrices Kj and Kc 
are 3 n/ and 3 nc, respectively. 

Table 3.4. Statistics about the load balancing among the subdomains (Ex. 2) 







max 


X 


s 


nelem 


166 


170 


169.5 


1.4 


nnode 


356 


384 


366.7 


10.2 


ni 


151 


252 


195.2 


35.6 


nc 




233 


171.5 


41.6 



4 The recursive substructuring algorithm 

4.1 Basic idea 

In this section we introduce an algorithm to solve the full Schur complement 
system (2.61) parallel, recursive, based on direct factorization. The basic 
idea of the recursion is as follows: After the unknowns corresponding to the 
interior nodes in the FE-subdomain have been eliminated (with building 
the inverse of the matrix K/J, we get a “stiffness” matrix Sci (the FE Schur 
complement matrix) corresponding to the coupling nodes of the subdomain: 

SCiUCi~Lci^ 



with 
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Sc. = Kc, - Kcu Kl^ Kic, , - Kcu , Kcu = KJc, • 

We can treat the substructure as one large finite element with the stiff- 
ness matrix Sci-> and these large elements can again be summarized into 
subdomains. The unknowns corresponding to the interior nodes of these new 
subdomains are eliminated with building the FE Schur complement matrix 
related to the coupling nodes of the larger subdomains. This technique can 
be further applied recursively. 

Parallel to the FE-substructuring one local Neumann boundary value 
problem has to be solved in each BE subdomain (elimination of Uy, see 
Equation (2.49)). After that, one has to compute the BE Schur complement 
matrices 5^. (elimination of u^): 

5b. = KcmK^IKac,. (4.2) 

In the implementation we use the following communication structure: 

The FE Schur complement matrices are computed in tree topology (with 
degree D and height i/), whereas the BE substructure Qi in a hypercube of 
dimension Ni, see Figure 4.1: 




Fig. 4.1. Communication structure of the recursive algorithm 



4.2 The parallel algorithm 

In the next algorithm we use the following notations: 

Notation: Meaning: 

Routine [i,p] — > The Routine runs on processor 

(i,p) of the tree topology. 

(for i = k\ \ s \ k2) — > Loop with the variable i, that 

runs from kl to k2 with step s. 

(par j = kl : s : k2): — > The Routine runs parallel on 

Routine [mj] the processors [mj] {j = kl : 

s : k2). 
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It is supposed in the following algorithm, that in the preprocessing the 
continuum under consideration is discretized into conform finite elements, 
and the finite elements and nodes within the BE substructures are deleted. 
This finest FE discretization is denoted with the indices [0,0]. The boundary 
conditions are also prescribed and stored together with the geometry data 
and material constants on the hard disk of the host computer. 

Now we present the recursive algorithm: 



— Input [0, 0]: Reading the mesh data and boundary conditions for the whole 
problem from the host computer. 

— SendB [0,0]: The root of the tree topology sends the mesh data (the 
geometry data of the appropriate coupling triangles) and the boundary 
conditions to the processors with the id 0 of the hypercubes, see the broken 
lines on Figure 4.1. 

— FE substructures: 

1. {fori = 0:l:H - 1): 

— (par k = 0 : 1 : — 1) Decomp [i, k]: Decomposition of the FE mesh 

[i, k] into D substructures: 

FE mesh [z 4- 1, j], {j = kD : 1 : (A: -f- 1)D — 1). Here we implemented 
the mesh decomposition algorithm of Farhat [4], see Section 3 

— (par A: = 0 : 1 : D* — 1) SendF [f. A:]: After the decomposition we send 
the mesh data of the j-th subdomain (FE mesh [^ -f 1, kD 4- j]) to the 
j-th son (processor {i 4- 1, A;D 4- j)) {j = 0 : I : D - 1). 

— (par A: = 0 : 1 : — 1) RecvF [i 4- 1, A:]: The sons receive the mesh 

data fi'om their father. 

2. (par i = 0 : 1 : — 1) MatGenF [H,i]: Generating the finite element 

stiffness matrices on the level H and assembling them in the subdomain 
stiffness matrix Ki. Here, the local numbering of the nodes is taken into 
account, i.e. the interior nodes come first, then the coupling nodes: 









The nodal load vector is also generated: 



_ ( 



I / 



(H,i) 



After that, the Dirichlet and Neumann boundary conditions are built in 
the matrix and the right-hand side. 

3. = 

— (par A: = 0 : 1 : D* — 1) SchurF [i,k]: Elimination of the unknowns 
in the interior nodes, that were not eliminated yet, with building the 
Schur complement matrix explicitly: 
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V ® \ 24C(i,fc) 



with 




fi =f -Kci.,,.K7\J. . 



- (par A: = 0 : 1 : D* — 1) SendFSchur [i,k]: Sending the Schur com- 
plement matrix Sc, ^Rd the right-hand side to the father 

processor. 

— (par fc = 0 : 1 : — 1) RecvFSchur [i — 1, A:]: The nodes receive 

the Schur complement matrices from their sons and assemble their own 
stiffness matrix The local numbering is taken into account: 









Here I denotes the unknowns in the interior nodes of the FE subdo- 
main [i — 1, A:], that were not eliminated yet. 

4. RecvBSchur [0,0]: The root processor of the tree receives the Schur 
complement matrices Sq^ of the BE subdomains: 

Scf = KcAiK^lKACi, i = 0,...,p-g-l, 



where p — is the number of the BE subdomains. These matrices will 
then be assembled in the FE stiffness matrix /^(o,o)- 
5. SchurF [0,0]: As the last step of the algorithm, we solve the Schur 
complement system concerning the matrix A"(o,o) • 

The eliminated unknowns can now be computed recursively with 

backward substituting the already computed vectors Uq^. .y 
— BE substructures: (par i = 0:l:p — gr — 1) 

The BE Schur complement matrix is computed parallel, in a hyper- 
cube of the dimension N{. 

1. RecvB [i]: The processors with the id 0 of the A:-th hypercube receive 
the geometry data of the coupling triangles of the A;-th BE subdomain 
and the boundary conditions. 

2. MatGenB [i] and SolveB [i] : Applying the collocation method, the 
BE matrices A{ and Bi are generated. Then the nodal displacement 
vector Uy. is eliminated, i.e. we solve a Neumann problem on each BE 
subdomain, see the discretized boundary integral equation (2.27) and 
the Poincare— Steklov operator (2.30). As a result, we achieve the matrix 
Li (the discretized form of the Poincare-Steklov operator) explicitly: 

Li = A~^Bi = A~^BiRBi- 
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To compute the matrix L{ we use the singular value decompisiton, as 
mentioned in Section 2 

3. SchurB [i]: In this routine we first compute the matrix Hi, see Equation 
(2.49): 

Hi = SiLi {=SiAr^Bi). 

Then we eliminate the nodal traction vector i.e. compute the BE 
Schur complement matrix see Equations (2.58) and (2.62): 

SE^ = KcmK^^Kac,. 

4. SendBSchur [i]: The processor with the id 0 of the hypercube i sends 
the matrix to the root processor of the tree topology. 



5 Implementation and numerical results 

5.1 Implementation on a distributed memory MIMD computer 

The algorithm was implemented on a Multicluster 3DE Transputersystem 
of the firm Parsytec with 32 INMOS T805 Transputers (clock frequency: 30 
MHz, 4 MB memory on each node, altogether 128 MB). The host computer 
with all the peripheral devices is a SUN SparcStation 10/30 with a connection 
board for the parallel computer. The developments of the parallel algorithms 
were made with an ANSI compatible C cross compiler for the runtime system 
PARIX using the versions 1.0 to 1.2 for T805 transputers. In Figure 5.1 two 
three dimensional hypercubes (for 2 BE subdomains) and one binary tree 
{D = 2) with the height H = 3 (for 8 FE domains) are shown. This virtual 
topology is used in the example in the same figure. 




Fig- 5-1. DecompositioTi of a plate with two holes into 8 FE and 2 BE sobdomains 
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The FE mesh consists of 480 elements (quadratical tetrahedra) and 1111 
nodes, and there are 108 collocation points (constant triangles) per BE sub- 
structure. The minimal (Min), maximal (Max), average (T) and strewing (S) 
values of the CPU-times needed for the computation of the FE and BE Schur 
complements are listed below: 



Table 5.1. CPU-times of computing the FE Schur complements 



Routine 


Level 


' f^proc 








mi 


MatGenF 


3 


8 


10.79 


14.67 






SchurF 


3 


8 


126.17 




395.17 




SchurF 


2 


4 


10.46 




21.52 




SchurF 


1 


2 


9.56 




9.89 




SchurF 


0 


1 


15.82 


15.82 


15.82 


0.00 



Table 5.2. CPU-times of computing the BE Schur complements 



Routine 


HCube 


flproc 






mi 


Bll 


MatGenB 


0 


8 






22.03 


2.58 


MatGenB 


1 


8 


18.14 


23.98 




2.43 


SolveB 


0 


1 


197.82 


197.82 


197.82 


0.00 


SolveB 


1 


1 


197.82 


197.82 


197.82 


0.00 


SchurB 


0 


1 


38.12 


38.12 


38.12 


0.00 


SchurB 


1 


1 




38.10 




0.00 



As expected, the explicit computation of the FE Schur complements on 
the lowest tree level requires the greatest computational cost. 

In the routine MatGenB the BE matrices are generated in hypercube 
topology with row wise distribution. The Neumann problem is solved in the 
routine SolveB. The low scattering values in the parallel generation of the 
BE matrices mean a good load balancing between the hypercube processors. 
The most expensive part of the computations in the BE subdomains is (as 
expected) the SVD algorithm in the routine Solve. The great benefit of SVD 
is however, that the matrices Hi are positive definite, which was not the case 
when applying other solution algorithms. 



6 Conclusions 

In this paper a recursive substructuring algorithm for solving the linear sys- 
tem of equations arising from a coupled FE/BE discretization of three dimen- 
sional linear elastostatic problems on distributed memory parallel computers 
is presented. The elastic domain under consideration is decomposed into non- 
overlapping FE and BE subdomains. The proposed algorithm is well suited 
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for stress concentration problems: The critical regions with high stress con- 
centrations are computed with the boundary element method, while the rest 
part is approximated with finite elements. 

In the FE region, the mesh decomposition algorithm of Farhat is used 
to create the FE subdomains. The decomposition is organized recursively, in 
tree processor topology. The root processor of the tree decomposes the mesh 
into two subdomains and sends the meshes to his sons. The sons further de- 
compose their meshes and so on. One benefit of this recursive implementation 
of Farhat ’s mesh decomposition is that it results in compact, connected sub- 
domains. (In the original algorithm of Farhat it is not guaranteed.) Another 
advantage of the recursion is, that only small systems of linear equations have 
to be solved on one level of the tree. Further, the mesh decomposition can 
be applied to quite arbitrary three dimensional domains. A drawback of the 
recursive substructuring algorithm is however, that because a direct equation 
solver is used, the Schur complement matrices have to be computed explicitly. 
Iterative solvers require the matrix usually only in form of a matrix-vector 
multiplication, so after factorization, the backward substitution doesn’t have 
to be performed. 

In the BE subdomains one has to solve local Neumann boundary value 
problems in order to compute the BE Schur complement matrices. Since 
the matrix of the linear system is close to singular and the solution is very 
sensitive to small perturbations, special technique has to be applied. The nu- 
merical tests show, that SVD is able to perform the theoretically guaranteed 
symmetry and positive definiteness of the matrix H. The drawback of the 
method however is the large number of fioating point operations. 

The algorithm was implemented on a parallel computer with message 
passing principle. With a suitable choice of the dimensions of the topologies 
the proposed domain decomposition algorithm can be applied to solve the 
mixed domain and boundary integral variational problem efficiently. 
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Coupling of BEM and FEM for Elastic 
Structures 

E. Schnack and K. Tiirke 



1 Introduction 

In this paper a numerical method for the solution of elliptic boundary value 
problems is presented. As an application we will analyze problems in linear 
elasticity. Especially we want to resolve zones with high stress gradients, for 
example the surrounding of notches in a loaded machine part. The stress 
peak may occur the failure of the whole structure. 

In the field of numerical simulation the Finite Element Method (FEM) 
is the standard tool and already well established for industrial applications. 
Another approach is the Boundary Element Method (BEM) which offers the 
possibility to reduce the effort of discretization in comparison with the FEM 
by one dimension. In addition the BEM is able to approximate very accurately 
high stress concentrations in compact domains with a relative low number 
of degrees of fi:eedoms. This leads us to the suggestion to combine the two 
methods FEM and BEM. Of special interest is the so called macro-element 
technique where the BEM substructures are interpreted as special finite el- 
ements. This strategy allows the implementation of the coupling method as 
an extension modul in existing FEM software. 

The algorithm of this paper follows this stategy. A special characteristic 
of our coupling algorithm which has been developed by the research group 
Schnack (see literatur in Section 2) is the non-conformity of the discretiza- 
tions. The introduction of a hybrid ansatz for the displacement and the trac- 
tion fields on the surfaces of the macro-elements and the formulation of vari- 
ational transmission conditions gives the possibility to use an independent 
discretization for each subdomain. Besides the larger flexibility concerning 
the mesh generation, this strategy is important for the parallelization of the 
algorithm. Here the stiffness matrices of the macro-elements and the FEM 
subdomains can be generated completely independent on different proces- 
sors. A communication between the processors is only necessary during the 
solution of the global system. 

The properties of the coupling matrix like symmetry and positive defi- 
niteness are controlled numerically within an adaptive scheme. 
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2 Basic system and continuous formulation 

2.1 Model problem 

We are analyzing interior domain problems in an isotropic domain fi with a 
Lipschitz boundary F in , m = 2, 3, as shown in Fig. 2.1. In the next step 




we define a non overlapping domain decomposition of i? in a basic domain 
and several substructures fii with i = 2, . . . with 

i? c niDnj = ^ forz^j, r = dn = rtUFu (2.1) 

and 

Ti n r = 0 and A = TU ( ^i) V i > 2. (2.2) 

For the sake of simplicity we restrict ourself now to the case p = 2 and define 

Fc = dOi n ai ?2 (2.3) 

as the coupling interface (see Fig. 2.1). We introduce the following notation: 
Values on Fc coming from 1?2 will be denoted with a symbol and values 
coming from i?i with a symbol. 

As a boundary value problem in the given domain i? we have the Lame 
system of linear elasticity with mixed boundary conditions. This leads us to 
the Navier equation 

Z\*u = 0 for ^ G i? (2.4) 

as the describing differential equation for the displacement field u G [H^ (f?)]”l 
A* denotes an elliptic partial differential operator. As boundary conditions 
we have given displacements on the Dirichlethoundeiry F^ and given tractions 
on the Neumann boundary Ff. 

tx(^) = u(^) for ^ G - r \ (2.5) 

for ^ G Ft. 



m = nu{o^m 



( 2 . 6 ) 
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n denotes the traction operator. 

As a result of the domain decomposition we have to fulfill additionally 
the kinematic and static transmission conditions 



U+iO = 


(2.7) 






t+iO=t-iO. 


(2.8) 



2.2 Set of coupling equations 

The base of our coupling method is given by a hybrid stress method, for- 
mulated 1973 by Schnack [25] and Tong, Pian and Lasry [33], which realizes 
a mixed FE method by the transformation of domain integrals in boundary 
integrals. Starting with a modified Reissner functional and by formulating 
independent trial functions for the kinematic and the static group one re- 
sults in compatibility and equilibrium conditions in a variational form. The 
stiffness matrices of the hybrid elements can be generated analytically, if the 
problem is restricted to special element geometries and to a small number of 
degrees of freedom in each element. 

1983 Schnack [26] realized a generalization of this method by incorporat- 
ing the boundary integral equation in the procedure. This extension allows 
the definition of arbitrary shaped ’’macro-elements” with an unrestricted 
number of degrees of ft*eedom. In the following years further developments 
and applications on two and three dimensional problems in elasticity were 
published by Schnack [27], [28], Carmine [1] and Karaosmanoglu [14]. A sta- 
bility and convergence analysis for the coupling method is done by Hsiao, 
Schnack and Wendland [13]. Comparisons with other coupling strategies can 
be found in the papers of Hsiao [11], [12] and Polizzotto and Zito [22]. A com- 
parable strategy to our coupling method was applied for example by Grannell 
[7], who also uses hybrid elements for coupling FEM and BEM. 

In the following we will define the four fundamental equations for the 
coupling algorithm on the continuous level. 

1. The first basic equation is the variational formulation of the given bound- 
ary value problem (2. 4), (2. 5), (2. 6), split up in two parts for the subdo- 
mains i?i and f 22 '. 

aiiuuvi) + a 2 {u 2 ,vi) - f{vi) V € [//o(^)]™> (2-9) 

where a denotes the positive definite and symmetric energy bilinear form 

ai{u,u) = j (Jij{u) eij(u) dfi (2.10) 

Qi 

with the stress tensor cr and the strain tensor e. The test function v\ 
is defined on the whole domain i7. The right hand side is a boundary 
integral on the Neumann boundary Ft 
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/(vi) = j ^^vidr. 

Ft 



( 2 . 11 ) 



Eq. (2.9) is the well known energy functional also used in the classical 
FEM formulation and represents a weak equilibrium condition. 

2. Additionally we need a second bilinear form to realize the nonconformity 
of the grids on the interface Fc. We define 

a2(V2,U2) = g{V2) V W2 € [ffo('^2)]'" (2.12) 

with the test functions V 2 only defined on the subdomain i? 2 - The right 
hand side is once again a boundary integral now defined on the interface 
Fc. With the notation 

J ydF =< X ,y > (2.13) 

we have 

g{v 2 ) = < , n_ > with = n[v 2 \. (2.14) 

3. As a side condition we require that the static equilibrium 

V (2.15) 



holds in each point of the subdomain i? 2 - 

4. The last basic equation results from the Calderon projector, introduced 
by Costabel and Wendland [4] 



f tx+(0 

V 4(0 




^I-K 

D 



V 







(2.16) 

In Eq. (2.16) V denotes the weak singular integral operator resulting from 
Kelvins fundamental solution, K and K* are singular integral operators 
and is a hypersingular integral operator. For our coupling method we 
can use an arbitrary projection of the Neumann data in the Dirichlet 
data on Fc. So we choose the singular formulation of the Poincare- 
Steklov operator resulting from the the Calderon projector (2.16) 



= 77 ^t-(- = 






-1 



(2.17) 



With this set of equations we can formulate the basic relationships of the 
coupling method. By applying the 1. Greens formula on the bilinear form 02 
we get 



a2{u2,vi) = J aij{u2)eij{vi)dn =< > - J aij^i{u2) vi. df2. 



(2.18) 
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Under consideration of the side condition (2.15) the last domain integral in 
Eq. (2.18) disappears and (2.9) can be transformed to 



ai(ui,Vi)-f < t+,v_ > = f{vi). (2.19) 

In the same way the second coupling equation (2.12) is equal to the interface 
integral 

< , (u+ - U-) > = 0. (2.20) 

The displacement field on Fc can be replaced by the Poincare-Steklov 
operator (2.17), so that we finally receive the coupling equation 



< U 



( 



-I + K 



V 



t+ > - < t+ , U- > — 0, 



( 2 . 21 ) 



that defines a variational compatibility condition on Fc consisting of positive 
definite and symmetric energy terms. Prom (2.21) we can calculate the trac- 
tion field in dependance of U- and insert the result in (2.19), so that finally 
there remains a variational form depending only on test and trial functions 
for the diplacements in the basic domain and on the interface Fc. 



3 Discrete formulation 

In this section we will derive an equivalent discrete formulation of the con- 
tinuous coupling equations (2.19) and (2.21) for the implementation in a 
numerical algorithm. In Fig. 3.1 you can see in the upper part an example 
of a decomposed structure fi in 2D with a coarse grid H which consists of 
triangular finite elements in i?i and of line elements on the boundary of 172 ■ 
On the right hand side the subdomain is depicted again with the coarse grid 
H and additionally with an independent fine grid h. 

The configuration on the interface Fc for the 3D case is depicted in the lower 
part of Fig. 3.1. Here the domain discretization H in I7i is realized with 
tetrahedra which generate coupling triangles on Fc. The fine grid consists of 
triangular boundary elements, in Fig. 3.1 we have nine boundary elements 
for one coupling triangle. 

Now we realize the spaces for the trial and test functions. We start with 
the space for the displacement field uh on the interface Fc 

U := {Hi/2(re)}. (3.1) 

The displacement field is approximated by the spline formulation 

Uh := {uh £U \ uh ^ S^{AH)}. (3.2) 

53 denotes the space of continuous spline functions of the order d. In an 
explicit form this leads to the piecewise polynomial expansion 
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FEW node boundary dement 




Fig. 3.1. Discretization in 2D and in 3D 



uh = NcOLc on Fc. (3.3) 

Because of the hybrid formulation we have to define an independent space 
for the traction field on Fc by 

W := < t , I >=0, < r X t , / >= 0} (3.4) 

with the unit operator I. The two side conditions guarantee that the traction 
formulation leads to a stress field in i ?2 that fulfills the condition (2.15) for 
the static equilibrium. An explicit formulation of these side conditions will 
be derived in Section 4. 

On the approximation level we have 

Wh := Sjl,{AH)}. 



(3.5) 
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^d-i denotes the spline space with a reduced order by one because the trac- 
tions are computed with the derivatives of the displacements. Additionally 
we use so called double nodes for the formulation of the traction ansatz. This 
leads to a discontinuous spline space. The double nodes are depicted in the 
right part of Fig. 3.1 by the white dots at the corner points. The explicit 
expansion is given by 

in = Rh ^ on Tc, (3.6) 

where the parameters /3 are independent of the parameters etc- With Eq. (3.3) 
and (3.6) we have realized the hybrid ansatz on the coarse grid H. In order 
to work on the fine grid h, we have to realize a projection of the coarse grid 
trial function in to th- If we use the same spline space Wh on the fine grid 
level, we obtain a transformation 

th = CtH, (3.7) 

with a constant projection operator C containing the coordinates of the fine 
grid elements h on the coarse grid elements //. As a result we obtain on the 
fine grid level the trial functions 

th = Rhfi (3.8) 

with 

Rh = C Rh- (3.9) 

We need the corresponding displacements Uh depending on the Neumann 
data th- This is realized by the initial formulation 

Uh = Nh^ on Tc. (3.10) 

Uh belongs to the same spline space Uh (see Eq. (3.2)) as the displacements 
uh on the coarse grid, the parameter groups /3 and 7 are connected by the 
Poincare-Steklov operator (2.17) 

Uh= (1^ + ^) V th. (3.11) 

On the discrete level this leads to 

= (3.12) 

This set of trial and test functions can now be used to fulfill the coupling 
equations (2.19) and (2.20) on the discrete level. This leads to the equivalent 
formulation 

ai{UH,VH)+ <tH,VH > = f{VH) (3.13) 

and 

< tn , Uh > - < th,Uh > = 0. (3.14) 
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Inserting (3. 3), (3. 6), (3. 7) and (3.10) in (3.13) and (3.14) under consideration 
of (3.12) we finally get the following block structure system: 




OLi are the parameters of the displacement ansatz similar to (3.3) in the in- 
terior of the b 2 tsic domain f}\ . The matrices K are the usual global stiffness 
matrices of the finite elements in and on Fc resulting fi:om the discretiza- 
tion of the bilinear form ai{u\,V\). The right hand side of (3.15) contains 
the load vector F, given by the discretization of the boundary integral f{v\) 
on the Neumann boundary Ft. 

The coupling equation (3.14) generates the integral operators 

S = <Rh,Nh> (3.16) 

and 

T = < Ah, Ne > . (3.17) 

Under consideration of the solution of the Neumann problem (3.12) the op- 
erator H is defined by 

H = SA~^B. (3.18) 

H is the interface coupling matrix which corresponds to the symmetric energy 
bilinear form on the continuous level 

a2{U2,U2) = J (Tij{U2) eij{U2) df] =< > . (3.19) 

On the discrete level we have the equivalence 

<th,Uh>-^0^ H^. (3.20) 

The symmetry and definiteness of H depends on the numerical accuracy for 
the computation of the Poincare-Steklov operator (3.11). In Section 6 we will 
develop an iterative procedure to control this accuracy. With a sufficient small 
numerical error we obtain the symmetric, but indefinite system (3.15). This 
structure can be transformed into a positive definite system by eliminating 
the parameter l3 with the Schur complement, so that we get with 

Kec = K,c + T^ H~^T (3.21) 

a symmetric and positive definite global stiffness relation 



JKii Kjc OLi _ F' i 

R- ci R- cc F c 



(3.22) 



System (3.22) can be solved by standard direct or iterative solvers for FE 
systems. 
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4 Trial functions for the traction field 

In this section we will derive an explicit form of the side conditions for the 
space W of the traction field on Fc (see Eq. (3.4)) for the 2D case. 

Let us consider a piecewise smooth boundary Fc of the subdomain i ?2 with 
nCc smooth subboundaries generated by the coarse grid elements H. In each 
element I {I < I < nec) we define 

(4.1) 

with a polynomial interpolation matrix Af and the free parameters . If 
we use polynoms of second degree for the interpolation we receive 

(4.2) 

and 

= 

with 

flf=[^C(C-l) 1-C^ ^C(C + 1)] = ["m(0 "^2(0 msiO]- (4.4) 

The interpolation functions are defined in a reference element C ^ [“I?!]- 
The superposition 

nCc nec 

= (4.5) 

1=1 1=1 

leads to double nodes at the corners between two elements as shown in the 
right part of Fig. 3.1. 

To ensure that the static equilibrium (2.15) and Cauchys symmetry condition 

aij = Oji (4.6) 

are exactly fulfilled in i ?2 we have to modify the polynomial ansatz (4.5). 
In the first step we formulate global side conditions, which are sufficient, if 
we have a completely smooth boundary Fc without double nodes. Condition 
(2.15) leads for the trace of the stress tensor to an equilibrium of forces 

nec n 

<t,/> = 0 or 'Y^ltfLidr = Q , i = l,2, (4.7) 

j=i/^ 

where Li is the length of the element 1. Prom the symmetry condition (4.6) 
we get on the boundary an equilibrium condition of momentums 

nec p 

<txr,I>^0 or ^ / (xf ^ i'l dr = 0. (4.8) 

1=1 
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Additionally we have to take under consideration local side conditions if we 
have a boundary Fc with corners that cause the generation of double nodes 
(see point Pq in Fig. 4.1). The corner is characterized by a jump of the normal 
unit vector n. With the definition of the traction vector 



— ^km 



(4.9) 



we can formulate the components of the stress tensor in Pq depending on the 
trace vectors and 



(T\k = ^ (^2^^ tfc - tf ) , 


(4.10) 


0-2A = ^ tf - nf ^ tf ) ; A: = 1,2. 


(4.11) 


In (4.10) and (4.11) the abbreviation 




d = 


(4.12) 



is used. By inserting (4.10) in the divergence theorem (2.15) we receive two 
equilibrium conditions 



( 2 ) 



dt^ 



,(i) 



dtf 



dtf 



dt^ 



+ = o . * = 1.2. 



dxi dx\ ' 5x2 5x2 

The symmetry condition (4.6) delivers one more local side condition 



(4.13) 






t2 — tf — tf. 



(4.14) 



Chaudonneret [2] has also introduced compatibility equations for the traction 
formulation at corner points. In the formulation of Chaudonneret the stress 
tensor is required to be symmetric and additionally the volume dilatation has 
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to be constant as a kinematic side condition. In comparison with Chaudon- 
neret the equations (4.13) and (4.14) define compatibility conditions only 
depending on the traction field. 

In summary we have to modify the polynomial traction ansatz (4.5) by 
fulfilling the global equilibrium conditions (4.7), (4.8) and additionally in each 
corner point the local conditions (4. 13), (4. 14). This leads to a decrease of the 
number of free parameters /3 ^ /3 according to the number of side conditions. 
We end up with the expansion 

tH = RHl3- (4.15) 

For details concerning the elimination of fi'ee parameters we refer to Carmine 

[!]• 

5 Calculation of boundary integrals 

For the numerical realization of the Poincare-Steklov operator (3.11) we have 
to calculate integrals of the type 

I K{U,x)ip{x)dr{x) (5.1) 

A 

with the i.g. regular shape function (^(x) and the strong or weak singular 
kernel function K(^^, x), depending if we have to integrate the operator K or 
V. Fig. 5.1 shows the different configurations in the two or three dimensional 
case. 



X 



2 ^ 




singular subboundary 
regular subboundaries k \ j 




[ 1 regular eubboundatiee K \ j 



Fig. 5.1. Geometrical approximation in and 



We restrict ourself to the case that the evaluation point is always 
located in the interior of a boundary element, so that we have no projection 
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of a singularity to neighbour elements. This can be easily realized by using 
the iterative solution scheme for the Poincare-Steklov operator descriped in 
Section 6. Another simplification is the use of only plaine elements, so that a 
curved surface has to be discretized by a sufficient number of plain elements. 

5.1 Singular integrals 

We give an overview on the methods used to solve integrals in the case ^ ^ € Fj , 
that means that the evaluation point is located inside the integration area. If 
the shape function (p(x) is a constant or a linear polynom the integrals can 
be solved analytically. In we have to solve line integrals. In the case of 
weak singular integrals Inr) we calculate the limit of an improper inte- 
gral, singular integrals r~^) can be easily handled by using the Cauchy 

principal value. In E^ analytical solutions of the weak and strong singular 
boundary integrals over triangular elements were derived by Cruse [5] and 
Lij Han, Mangy Torzick[\^]. 

In the case of using arbitrary regular shape functions ip{x) (for example if 
we deal with so called augmented elements) we also implemented a numer- 
ical version for the solution of singular integrals. In E^ the weak singular 
integrals can be solved by using a nonlinear coordinate transformation in- 
troduced by Telles [32]. The strong singular integrals are treated with the 
Clenshaw-Curtis algorithm {Clenshaw, Curtis [3], Davis, Rahinowitz [6]). In 
E^ the weak singular integrals over a triangle can be regularized by the trans- 
formation to polar coordinates. The remaining regular integral can be solved 
by a standard Gauss quadrature. For the strong singular integrals we also 
use polar coordinates and calculate the finite part of the integral in radial 
direction with the method of Kutt [17]. More general and efl&cient algorithms 
for this problem are published by Guiggiani, Gigante [8] and Kieser, Schwab, 
Wendland [15]. 

For details of the formulation and implementation we refer to Tiirke [34]. 

5.2 Regular integrals 

For the calculation of regular boundary integrals ^ Fj) we use a general 
numerical algorithm with an adaptive mesh refinement due to an estimate of 
the integration error. This strategy was introduced by Lachat and Watson 
[18]. Here we will explain the basic features for the integration over triangular 
boundary elements Fk. 

Applying the coordinate transformations depicted in Fig. 5.2 on F^ we receive 

ip{x) arm = J k{$i,x) drm = ( 5 . 2 ) 
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Fig. 5.2. TVansforraation of the boundary element F* 



Ni Ni y y ^ 

mi=\ m2=l_i 

The two line integrals over the unit square in the ()Lti,/X 2 ) system can be 
approximated by a two dimensional standard Gauss quadrature formula. If 
is the abbreviation for the whole kernel function in the last part of 
(5.2) we finally obtain the approximation 

N\ N 2 Tl2 Tl\ 

E E E (5.3) 

mi = l m2 = l A=1 /c=l 

To find appropriate values for ni,ri 2 ,iVi and we use the error estimate 
derived by Stroud and Secrest [30] 



+1 +1 



r r ^2 ni 

J J A{fj.i,n 2 )dnidn 2 - EE A{fjti (/c),/i 2 (A))w« wx 



-1 -1 



A=1 K=1 



with 






dUi 



2no 



<H, 



G 



p ) '^p 



22«p(2rip)!' 



< 2 E 

p=i 

(5.4) 

(5.5) 



G is an appropriate constant. Under consideration of some simplifications (cf 
[34]) we obtain for kernel functions with the characetristic (r“^) the estimate 
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/ L \ 

(2np + 1) <Cint ; P = 1, 2. (5.6) 

\ ^ f*min J 

Lp is the characteristic length of the actual subelement in fXp direction and 
dmin is the minimal Euklidian distance between the evaluation point and 
the actual subelement. For a free chosen upper bound of the admissible nu- 
merical integration error Cint formula (62) can be used to determine the 
required number of subelements N\ and and the number of Gauss points 
ni and n *2 in each subelement. 

Numerical tests (cf [34]) have shown that the estimate (5.6) is too pes- 
simistic. Schwab and Wendland have found in [29] that the error estimate 
(5.4) is only correct in r]i or /xi direction but has to be replaced by a deriva- 
tive free error estimate in rj 2 or 1 x 2 direction. This is due to the Duffy coor- 
dinate transformation J 2 (r/i,r/ 2 ) in rj 2 direction. Appropriate derivative free 
error estimates were introduced by Rabinowitz and Richter [23]. For the given 
problem we obtain a precise numerical integration scheme by the determi- 
nation of ni with formula (5.6) for a given bound Cint- The number ri 2 of 
Gauss points in r ]2 direction has to be chosen by 



ri2 > 



|ln/i| 
In 5 



ni. 



(5.7) 



Here h is one half of the element length (cf Fig. 5.2) of the integration element 
Fk in the real space (xiyX 2 ,xs) and J is a geometrical value depending on 
the tangential vectors x\i x \2 ^he corner point P 3 of F^ which has the 
smallest distance dmin to the evaluation point For details we refer to [29], 
[34]. 

This numerical tool is the base for a precise and efficient numerical treat- 
ment of the Poincare-Steklov operator. 



6 A Neumann series for elasticity problems 

In this chapter we will derive an iterative scheme for the numerical solu- 
tion of the singular boundary integral equation (2.17). The base for this 
method goes back to the classical Neumann series for the Laplace equation 
by C. Neumann [21] in 1877. In 1965 and 1969 W.L. Wendland [35] and 
W. Haack, W.L. Wendland [9] used the expansion of the Laplace equation in 
a Neumann series with singular potentials for the construction of a numerical 
algorithm. 

We start with the interior domain problem in the subdomain i ?2 C (cf 
Figure 2.1) with given Neumann data on the boundary Fc. For a consistent 
notation we change the index j?2 

As a result we want to obtain the solution of the differential equation 



zi*tx(O=0 V 



( 6 . 1 ) 
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Remark 1: In the following deriviations we use the work of Kupradze et.al. [16] 
who requires for the solution u{^) of Equation (6.1) the smoothness 

u£C^(n+)nC^iO+) (6.2) 

with i7-|_ = i? 4 . U Fc. For the boundary Fc Kupradze requires 

Fc Fli{a) , Q( > 0, (6-3) 

where 77i (0) is the class of smooth surfaces and 77i (a) , a > 0 is the class 
of the Liapunov surfaces. An extension of these requirements to Lipschitz 
surfaces was derived by Costabel [4]. Costabel uses the regularity results 
of Necas [20] for the Neumann and the Dirichlet problem. If we choose 
for the Neumann problem the tractions on Fc from the Sobolev space 
t_j_ € with a € [0, |], we obtain the displacement field on 

the boundary in ix € Because Bettis theorem remains valid 

for Lipschitz boundaries, we can apply the energy relations of Kupradze 
also on general Lipschitz boundaries. 

The solution of (6.1) can be obtained as follows: The Calderon projector 
(2.16) in the singular formulation gives a mapping of the boundary values on 

1 

u{^)^{-i-K)u{0 + vt+{^) V = (6.4) 

With (6.4) we can calculate the boundary displacements on Fc so that all 
boundary values are known. The displacement field in the interior domain 
can be calculated with Somiglianas formula. 

The solution u(^) in the interior domain i7_|_ can be extended to the exterior 
domain ^ E i?- with i?_ = E^\i7+ by Somiglianas formula because all 
integrals for radius i? — > oo disappear. This is due to the ass 3 maptotic behavior 
of the singular fundamental solutions. While the displacements are continuous 
at the interface Fc we obtain a discontinuity of the tractions t_|. t_ known 
as jump relation of the double layer potential. 

As a result we obtain a continuous displacement field u{^) in the full space 
^ U . By applying the stress operator on the displacement field 

tx(^) the stress field a{^) in the interior and the exterior domain is achieved 
as first derivatives of the displacement field. 

Definition 1: The linear function space X is defined as the space of elastic 
states that includes all stress and deformation states in the interior do- 
main i?_j_ and in the exterior domain i?_ caused by a loading of on 
Fc with an admissible traction field t_f_. Admissible traction vectors are 
charcterized by the requirement that leads to an equilibrium system 
in (cf Equation (2.15)). 

For the explicit formulation of we refer to Chapter 4. 
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Definition 2: We call the static field (t(^) and the kinematic field u{^) in the 
exterior and the interior domain the representatives of the elastic states 
and denote the elements of the space X in general with g £ X. 

If the elastic state is represented by the displacement field u(^), the solution is 
not unique due to the constant vector c, that includes the rigid body motions. 
In c consists of three translatoric and three rotatoric components. 

Definition 3: We define X' cts a subspace of X, that contains the vector c of 
the free constants: c G X'. Each element G X is associated with a class 
of elements 

[9] — {g + c:ce X'} (6.5) 

and is denoted as a linear manifold. As a result two linear manifolds are 
equal if 

9i-92^ 

The set of linear manifolds [g] also defines a linear solution space and is 
called the factor space 

X = x/X'. (6.6) 

o 

The solution u{^) is now defined in the factor space X. 

With a scalar product (g,g) and a norm 

IIpII = (6.7) 

o 

of the elements g of the space X the elastic states define a real Hilbert space, 

o 

since it can be shown (cf Rieder [24]) that the space X is a real, linear, closed 
and complete function space. 

Definition 4: For two displacement fields u{^) and u{^) with ^ G U i?_ 
the scalar product of the two associated elastic states g and g is defined 
by the positive semidefinite elastic energy in the interior and the exterior 
domain (cf Equation (2.10)) 

9) = ~ J dn J aij (u) 6 ij (u) dfi. 

fi— 

( 6 . 8 ) 

By applying the first Greens formula on (6.8) the domain integrals can be 
transformed to integrals on the interface Pc. We obtain 

(s,f) = j t+j{u)ujdr - j t-j(u)ujdr. (6.9) 



In the function space X we investigate the properties of the projection (6.4) 
with a given traction vector stud an unknown boundary displacement 

vector w(^). 
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The following spectral analysis of (6.4) refers to Kupradze et.al. [16] and 
Taylor [31]. For a consistent notation we define 

(Ij-if) =-y [T{dy,n)riy-x)]'...dyS. (6.10) 

5 

The formulation — J . ..dyS is used by Kupradze (cf p. 351, Equation (2.1), 

(2.2)). The displacement vector n(^) may be the representative of the elastic 
state. From (6.4) we receive the formulation 

g-XTg = Vt^ (6.11) 

with 

A = -fl (6.12) 

which is consistent to Kupradze (p. 351, Equation (I)). The associated eigen- 
value problem (cf Kupradze, p. 362, Equation. (4.4)) is defined by 





g - \Tg = 0. 


(6.13) 


With 


1 


(6.14) 


and 


g*=i^Vt+ 


(6.15) 


we obtain from (6.11) 


9 * = {f^-T)g 


(6.16) 



which is consistent to Taylor (p. 253). Since the function space X is a complete 
and closed space, the domain D{T) and the range W(T) of the operator T 

o o 

lies in X- We denote according to Taylor T C [X]- The solution g can be 
calculated with (6.16) by 

g = i/j. - T)~^ g* with fi-1. (6-17) 

The existence and uniqueness of the solution of this Neumann problem in 
is shown by Kupradze for the continuity conditions in Remark 1. Now we 
investigate, if (6.17) can also be obtained by the expansion in a Neumann 
series. This is possible, if a resolvent with 

oo 

(m - r)-' = Am = E ( 6 . 18 ) 

m=l 



o 



can be represented by a convergent series in the space X. 
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Definition 5: If the scalar /x in (6.16) is of the kind that the range of (/x - T) 

o 

is dense in X and if (/x — T) has a continuous inverse then /x is in the set 
of resolvents p{T) of T. All values of /x, that axe not in p{T) are called 
the spectrum cr(T) of T. 

Definition 6: If cr(T) is bounded and not empty, we define 

r^(T) = sup |/x| (6.19) 

and call {T) the spectral radius of T. 



In the following we use the property that X is a real Banach space (cf 
Rieder [24]). With this property we can formulate a sufficient convergence 
criteria for the expansion (6.18) in Theorem 1: 

o o 

Theorem 1: If T ^ [X] and X is a (complex) Banach space, the resolvent Rn 
is given by 

oo 

( 6 . 20 ) 

m=l 

if 1^1 > r^(T) holds. This series also represents if the series is con- 
vergent and |/x| = ra{T). The series is divergent for |/x| < Va- in 

The proof of Theorem 1 can be found in Taylor, p. 262. 

To proof the convergence of the series (6.20) and the existence of a solution 
for problem (6.16) with a Neumann series, we have to show according to 
Theorem 1 with /x = -hi that the spectral radius holds 

< 1 . ( 6 . 21 ) 

Since /x = 1/A (cf (6.14)) the convergence criteria (6.21) is equivalent with 
the condition that all eigenvalues of the eigenvalue problem (6.13) hold 

Ai>l V x = l,2,.... (6.22) 

The validity of inequality (6.22) for the integral operator T can be shown by 
considering three theorems of Kupradze [16] (pp. 362-364): 

Theorem 2: All eigenvalues A of the kernel of the integral operator T are 
real. 

Theorem 3: The eigenvalues A are not smaller than unity in absolute value. 
Proof: Prom Theorem 2 we have A E M. As a consequence the eigenvalues 
can be determined by the formula 
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^ a+(u,u) + g-(M,u) ^ 

|o_(«,u) — a+(«, u)| 

f (Jij (u) 6ij (u) dn + J (7ij (u) 6ij (it) dn 
I / (Tij{u)eij{u)dn - f aij(u)eij(u)df2l 

Q— i7_j. 

/ 4 j(«) Uj dr - J t-j(u) Uj dr 

_ 

I - / Uj dr - J t+j{u) Uj dr\ ■ 

r. r. 



(6.23) 



Because the energy bilinear forms and a_ are positive semidefinite we 
find 



|A| > 1, 

and the proof of Theorem 3 is complete. 



(6.24) 



We investigate the cases A = -fl and A = — 1: 

a) A = -f-1: According to formula (6.23) for the eigenvalues in terms of an 

energy quotient, for A = H-1 we must have either a+(n,tx) = 0 or 
a_(n,tx) = 0. From the physical point of view this is equivalent to a 
vanishing state of deformation in the interior or the exterior domain, 
that means a rigid body motion of the interior or the exterior domain. 

o 

Because the solutions are defined in the factor space X (cf (6.6)) the rigid 

o 

body motion is equivalent to a zero element in A. It follows that A = 4- 1 
is a regular value of the problem (6.11). 

b) A = — 1: Since the energy bilinear forms a(u, u) are positive semidefinite, 

negative eigenvalues are not possible according to the formula (6.23). 

Theorem 4: The eigenvalues A are simple poles of the resolvent. 

As a consequence of Theorem 4 all eigenvalues are isolated and A = -fl is 
not an accumulation point of the eigenvalues. 

In summary we can conclude from Theorem 2,3 and 4: 



Ai > Ai_i > . . . > Ai > 1. (6.25) 

As a consequence the convergence criteria (6.22) holds. 

Additionally with ju = 1/A for A = -fl the inequality (6.21) is valid and 
according to Theorem 1 we can compute the solution g by the convergent 
Neumann series 

oo 

(6.26) 

m=l 

With the initial notation of the Calderon projector (6.4) we obtain the Neu- 
mann series 
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oo . 

n(0 = = E (2 ^ ^ 

m=l 

From that a recursive scheme for approximations can be formulated 

by 

= (I (6.28) 

with the initial value 

= 0. (6.29) 

With the formulation (6.28), (6.29) we can implement a numerical scheme 
as shown in the following chapter. 



7 Numerical results 

First we want to study the convergence behaviour of the iteration rule (6.28) 
derived in Section 6. Therefore we analyze a Neumann problem, for example 
a thick plate under constant tension ao with a circular cutout (cf Figure 7.1). 




Fig- 7-1. Discretized plate under tension 



The surface of the plate is discretized by 224 triangular boundary ele- 
ments. With a trivial start solution we calculate the surface displacement 
field iteratively by using the expansion (6.28). 
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Iterations 

Fig. 7-2. Convergence of the Dirichlet data 



Fig. 7.2 shows two different error norms of the displacement field while 
increasing the number of iterations {k) with the definitions 



errl = 



uFEM(^P^ 



In the left part of Fig. 7.2 we compute a characteristic displacement vector 
Uh on the notch surface in P with the Neumann series and compare it with a 
reverence value generated by a FEM analysis of the same problem with 

a very fine grid ( 30237 degrees of freedom). In the right part we compute the 
L ‘2 norm of the numerical displacement field Uh on the whole surface of the 
plate and observe the variation from iteration (k) to (A: -f I). The parameter 
Cint denotes different upper bounds for the integration error (cf Section 5). 

Since we observe a good convergence behaviour of the Neumann series 
in the first example we can incorporate this iterative algorithm in the coup- 
ling method. In the next numerical test we analyze a two dimensional notch 
problem with domain decomposition under consideration of the compatibility 
conditions for the traction field derived in Section 4. Fig. 7.3 shows the macro- 




Fig* 7* a* Plate under tension with substructures and i?a 
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element in the near field of the notch and the discrization of the skeleton 
with triangular finite elements. For the numerical experiments we use 187 
fine grid elements for the discretization of the coupling interface Fc. As an 
error indicator we define a relative symmetry defect 



MSD = 



EEiESl(/»ii + MI 



100 (%) 



(7.2) 



to control the symmetry of the coupling operator H (cf Eq. (3.20)). q denotes 
the dimension of the quadratic matrix H with the elements {hij). A second 
error indicator to control the definiteness of H is the smallest eigenvalue 
\min • For the notch problem with an infinite long plate in two directions we 
can solve the problem with the Kolosov Mushelishwili representation. This 
leads us to an analytical solution for the stress peak value 



Oiexact = 4.7632, (7.3) 

which is used as comparison with the numerical results, see Hahn [10]. To 
compare the numerical and the exact values of the stress concentration we 
define the relative error 



_ |o^nwm ~ OCexact] /rr 

Sa — • V* 

O^exact 

As shown in Table 7.1, we have a high accuracy for the stress peak after 38 
iteration with a deviation of only 0.26%. 



Table 7.1. Stress peak, symmetry defect, lowest eigenvalue for notch problem 



iter 


1 


10 


20 


25 


30 


36 


37 


38 


OCnum 


0.54 


3.53 


4.32 


4.49 


4.61 


4.73 


4.74 


4.7505 


[%1 


88.7 


25.9 


9.30 


5.74 


3.21 


0.70 


0.49 


0.26 


MSD [10“^] 


5.05 


5.45 


6.17 


6.36 


6.50 


6.61 


6.63 


6.64 


Amin [10-"] 1 


0.99 


2.83 


3.19 


3.26 


3.30 


3.335 


3.338 


3.341 



We can control automatically Amin ^nd MSD^ so that we can control local 
stability of the method providing high accuracy and reliability. In Table 1 one 
can see the dependence of the minimum eigenvalue and the symmetry defect 
of the problem on the number of iterations; Amin increases with the number 
of iterations in the first phase rapidly while the symmetry defect remains on 
a low level between 4 • 10“^ and 7 • 10~^. This allows us to define a priori 
an upper bound for MSD and a lower bound for Amin- Then the algorithm 
automatically increases the fine grid density and the number of iteration 
steps for the Neumann series until the given requirements of symmetry and 
stability are fulfilled. 

For further numerical investigations and an extension of the coupling 
method to three dimensional problems we refer to [14], [34] and [13]. 
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8 Summary and conclusions 

The numerical experiments have shown that the coupling method gives the 
possibility to obtain a high resolution of stress concentrations with only a 
coarse global FE discretization by using singular integral operators. With 
the Neumann series approach we are able to control the numerical charac- 
ters of the coupling operator as symmetry and stability within an adaptive 
algorithm. 

For future developments the method can be used to analyze nonlinear 
problems in continuum mechanics iteratively and to solve coupled problems 
in general field theory. 
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Feist Algorithms for the Solution of 
Pseudodifferential Equations 

D. Berthold and B. Silbermann 



1 Introduction 

The paper deals with the fast solution of pseudodifferential equations Au = f 
of order zero given on a certain curve F. Such equations often arise from 
boundary value problems. Let, for example, i? C be a finite domain having 
a smooth closed boundary F and let us given a partial differential equation 
(or a system of such equations) in 17 together with boundary conditions 
on F. The Boundary Element Method (BEM) being based on an integral 
formulation of the original problem is a powerful tool for its solution. Since 
we have to discretize only the boundary T, the BEM is even more efficient 
than the Finite Difference Method or the Finite Element Method for some 
classes of problems. 

In many cases the BEM produces an integral equation having a kernel 
singularity of the type a so-called Cauchy type singular integral equation 
given on the smooth closed curve F 

a{t)u{t) + ^^ [ f k{t,T)u{r)dT = f{t), t £ F, (1.1) 

TTi Jr T -t Jr 

where fc(t,r) is smoother than {r—t)~^. The present paper is mainly devoted 
to the solution of such kind of equations. 

As a second example, we consider the case when F is an interval of the real 
axis, say F = [—1,1]. Then we shall call equation (1.1) the airfoil equation^ 
because it plays an important role in aerodynamics, but also in fracture and 
crack mechanics and elasticity theory. 

There is an enormous number of papers dealing with approximate meth- 
ods for solving (1.1) and with methods for the solution of the algebraic sys- 
tems of equations arising from the approximation methods. The most familiar 
approximation methods are Galerkin, collocation, and quadrature methods. 
However, from these methods there always arise algebraic systems having 
unstructurized and non-sparse coefficient matrices. Thus, for the first look 
we need O(n^) operations in order to get n unknowns of the approximate 
solution. But in this situation, the multigrid methods developed in the last 
15 years prove to be an advantageous alternative to the direct methods, see, 
for example, [11, 12, 22, 2]. Using multigrid strategies we can reduce the 
computational complexity to 0(n*^) operations. 
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Moreover, one can develope strategies whose amount of computational 
work is only 0(n log n). Such a strategy being based on the Fast Fourier 
Transformation was described in [1] for the first time. In this paper, equation 
(1.1) with r being a smooth closed curve has been solved by a special ap- 
proximation method with trigonometric ansatz functions and its convergence 
rate has been shown to be optimal in the scale of the usual Sobolev spaces 
H®(r), 5 G E. Here and in the following the term optimal means that if Un 
denotes the approximate solution oi Au = f the error \\u-Un\\w{r) behaves 
like for all t : to < t < s with a certain to ^ E, supposed that 

/ G Unfortunately, we can get error estimates only for to >1 from 

the strategy in [1]. On the other hand, just in the case of the BEM, one is very 
interested in a good convergence behaviour of the approximate solution Un 
in Sobolev norms with t < 0, because the error of the approximate solution 
of the original boundary value problem can be estimated (even in uniform 
norms in the interior of i?) by the H^(r)-norm on the boundary F. 

In the author’s paper [4], there is proposed the so-called corrected collo- 
cation method as an alternative to the qualocation method (see [7, 23, 24]) 
for equation (1.1). Both methods aim at the improvement of the approx- 
imation error in ’’negative” Sobolev norms. However, fast algorithms for 
the solution are not available. In the present paper we give an algorithm 
called Parametrix-Galerkin method being based on the corrected colloca- 
tion method and on the strategy of [1]. Its computational complexity is only 
0(n log n) and it has a better convergence behaviour in ’’negative” Sobolev 
norms than the collocation method. Further, we describe some other feist al- 
gorithms for equation (1.1) given on smooth closed curves F or on the interval 
[— 1, 1] and prove their convergence in several scales of function spaces, which 
we define in Section 2. In Section 3 we deal with special properties of equa- 
tion (1.1). The ’’classical” approximation methods are described in Section 4, 
whereas in Section 5 the fast algortihms are proposed. All of the algorithms 
consist in two steps. In the first step, a parametrix for the pseudodifferential 
operator A is used in order to get an approximate solution on a fine grid. In 
a second step this solution is corrected in a certain manner by the solution 
on a coarse grid. Mostly, we only give an overview of the algorithms since 
the results are partly published yet, see [2, 3, 4, 5]. The error estimates in 
Holder-Zygmund norms (Subsection 5.3) are new. With some remarks on 
further investigations (Section 6) and numerical results (Section 7) we close 
our paper. 



2 Scales of Spaces 

Let us introduce some function spaces. All what follows is well-known and 
can be found, for example, in [16, 26, 27]. For F being a smooth closed curve 
let L^(r) be the Lebesgue space on F provided with the norm 
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l|w/’||L2(r) ^j^\ur{x)f\dx^ , 



where \dx\ refers to the arc length diflFerential. With the help of the (regular) 
1-periodic parametric representation 7 : E T of the curve F we have a 
one-to-one correspondence between functions u r defined on F and 1 -periodic 
functions u defined on E. We put u{x) = ur{'y{x))^ x € E. We denote by 1 ? 
the Hilbert space of all square integrable 1-periodic functions endowed with 
the scalar product and the norm 



|l 2 := yj{u,u), 




u{x)v{x)dx, 



respectively. Note that ||ur||L 2 (r) = Further, let C be the Banach 

space of all 1-periodic continuous functions on E and the set of all 1- 
periodic infinitely differentiable functions, where 



||u||c max{|u(x)| : x € E} = max{|u(x)| : x G [0, 1)} 



defines the (maximum) norm in C. We denote by H^,s E E, the periodic 
Sobolev space of order s, i.e. the completion of with respect to the norm 

IHIh. , (2.1) 

V kez* / 



where the expressions (u^ek) are the usual Fourier coefficients of the func- 
tion u with respect to the orthonormal system {ek}k£Z of the trigonometric 
polynomials ek{x) = and Z* := ^\{0}. Here are some batsic properties 

of the periodic Sobolev spaces. 

- The H" are Hilb ert spa ces with the scalar product 

(u,i))h» = Keo)(t),eo) + Efc€Z*I^P*Kefc)(v,efc). 

— The space H”^, s E E, is the dual space to and the relation 
II^IIh— = sup{|(u,it;)| : w E H^,||t/;|1h- = 1} holds. 

— If t < s then is a dense subset of and |1 u||h* < II^IIh- for each 

u E H^. Moreover, the imbedding C is compact. For this reatson, 
the family proves to be a Hilbert scale. 

- If s > I then is continuously imbedded in C. Furthermore, forms 
an algebra on which the operator al of multiplication is continuous. 



Let us introduce another scale of functions spaces. We denote by V the 
operator of (distributional) differentation and put A\u{x) := u{x-\-h) —u{x) 
and A^u = Aj^{A^~^u), k = 2,3,... With the help of A^ we define the 
Holder-Zygmund seminorm by 



M 7 := 



SUPh>0 • 



SUPft>o 



h'^ 

II^MI 



0 < 7 < 1, 

7= 
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and the Holder-Zygmund space W, s = m + 7 , m € N U {0}, 0 < 7 <l,by 
--{ueC: e C; fc = 0, m; [V^u\^ < 00 } 

endowed with the norm 

m 

■.= Yl\\V>^n\\c + [V^u\,. 

k=0 

Here we list some properties of the Holder-Zygmund spaces. 

- The 3ie Banach spaces. 

- If 0 < t < s then is a dense subset of V} and ||u|| 7 ^t < ||u||7/- for each 
u £71^. Moreover, the imbedding C is compact. 

- For each s > 0, the space 7i^ forms an algebra on which the operator al 
of multiplication is continuous. 

Finally, we are going to introduce a scale of (nonperiodic) spaces defined 
on the interval (-1, 1). Let p{x) = (1 - x)^(l -f x)^, a,P> -1, be a Jacobi 
weight function, x £ (-1, 1), and 1) be the Hilbert space of all 

square integrable functions on (—1,1) with respect to the Jacobi weight p 
endowed with the scalar product and the norm 

u[x)v{x)p{x)dx^ 

respectively. We denote by C[a, 6 ] the Banach space of all continuous func- 
tions on the interval [a, b] endowed with the maximum norm 

l|w||c[a, 6 ] := max{|u(x)| ; x e [a, 6 ]}. 

Further, let be the sequence of the orthonormal Jacobi polynomials 

(with respect to the Jacobi weight p) with positive leading coefficients and 
degpn = For any real number s > 0 we define the Sobolev type subspace 
g of L‘p as follows 

:= {w e : ||u||l 2 , < oo}, 

where 

) i 

2 

( 2 . 2 ) 

The expressions (ti,p^)p can be interpreted as the Fourier coefficients of u 
with respect to the orthonormal system {pl)k>o of the Jacobi polynomials 
. Note that = L^. Here we list some properties of the spaces ^ (see 

[3]): 

- The ^ are Hilbert spaces with the scalar product 

= iu,Po)p{v,Po)p + E^l k^^{u,pi)p(‘^^Pk)p- 
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- If 0 < t < s then is a dense subset of and \\u\\i^2 ^ < \\u\\i^2 ^ for 

each u £ Moreover, the imbedding ^ C ^ is compact. For this 
reason, the family {L^ ^}5>o proves to be a Hilbert scale. 

- If s > I then the functions u £ ^ are in C[a, 6] for all [a, 6] C (-1,1). 

Moreover, if a,^ < \ then ^ is continuously imbedded in C[-l,l]. 
(More precisely, there is a continuous representative in the equivalent class 
containing u.) 

If s € N then the spaces ^ can be described in another and very simple way 
([3]). Indeed, the space ^ is the same as the subspace of all functions u G 
having the property that \\T>^u • ip^\\i,2 is finite for all A: = 0, ..., s, where ip is 
defined by ip{x) := \/l - Moreover, the norm ||u||l 2 ^ is equivalent to the 
norm \\V^u-ip^\\i^2. 

3 Pseudo differential equations 

3.1 Pseudo differential equations on smooth closed curves 

Let r be a smooth closed curve in the complex plane and let be given on F 
a pseudodifferential equation of order /? = 0 

Arur = fr, /r € L‘^(r). (3.1) 

The Cauchy singular integral operator on F 

Srurit) = ^ f t e r, (3.2) 

7Tl J p T — t 

is an example of such an operator. Let T := {2; G C : j2;| = 1} be the 
unit circle. The action of Sj on L‘^ (T) can be described (in terms of Fourier 
coefficients) by 

^ Ylk>o - E*<0 

Obviously, St has the property = /. For this reason the operators Ft = 
{I-\-St )/2 and Qt = (I — St )/2 prove to be projections on L^(T). Note that, 
for given functions ax and 6x on T, the singular integral operator ax/ 4- 6x5x 
equals cxPtH-c/xQt, where cx == axH-6x, dx == ax — 6x- Using the well-known 
Agranovic representation of any periodic pseudodifferential operator of order 
0 

Ap = cxPr + djQr + ^5 (3-3) 

with K being a certain compact operator, we can restrict our investigations 
to the case when JT = T. Thus, we only have to deal with singular integral 
operators with compact perturbation At = ax/ + fexS'x -h K. Finally, using 
the parametrization 7 : M -> T, y we get 
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SjUj{t) — ^ I 
Jo 

=/■ 

Jo 



^2niy _ ^2nix 



dy 



u{y) [1 + i cot7r(x - y)] dy =: Su{x), x E [0, 1), (3.4) 



where u{y) = Thus, in the sequel we shall investigate the singular 

integral equation 



Au = au-{-bSu + Ku = ^ ( 3 . 5 ) 

being equivalent to (3.1). Note that S acts on trigonometric polynomials 
according to the rule Sck = Ck, k>0 and Sek = -caj, A: < 0. 

For two Banach spaces X and Y let £(X, Y) be the Banach space of all 
bounded linear operators A : X Y and /C(X, Y) the set of all compact 
operators K : X Y. Further, the symbol G£(X,Y) denotes the open 
subset of £(X, Y) of all operators A E £(X, Y) with im A = Y having an 
inverse A~^ E £(Y,X). If X = Y we simply write £(X), G£(X), and /C(X), 
respectively. 

We are going to study some properties of A in Sobolev spaces and H51der- 
Zygmund spaces. Put c := a + 5 and d := a - b and denote by [a, S] the 
commutator aS — Sa. 



Lemma 3.1 ([4, 19, 20, 21]). Let 6 > 0 be an arbitrarily small real num- 
ber. 

(i) If a eW, r> 0, then [a, S] 6 X:(H*, H'’) for all s>\. 

(a) If a g IV, r > 0, then [a, 5] € KlfH* for all s > 0. Moreover, if 
r < 1 then [a, 5] g K,{Ii‘,W) for all s>0. 

Denote by P and Q the analogue of Pr and Qt in the space . They can 
be defined by Pu = 2fc>o(n, e*) e* and Q — I — P. Obviously, P and Q act 
on trigonometric polynomials as 



Pei 



ei, I > 0, 

0 , 1 < 0 , 



Qei 



0 , 1 > 0 , 

ej, / < 0. 



(3.6) 



It is easy to see that aS — Sa = 2QaP - 2PaQ. Thus [a, S] is compact if 
and only if both QaP and PaQ are so. 

Lemma 3.2 ([21]). 

(i) If c^d £ H^, r > |, are invertible then B := c~^P -h d~^Q E £(H^) for 
all s <r. 

(ii) If c,d £ T-P, r > 0, are invertible then B £ for all s : 0 < 5 < r. 

The operator B is a so-called parametrix for the operator A. Define T := 
I — BA. Since T = c~^ {QcP — PdQ) -f- d~^ {PdQ - QcP) — BK^ one has that 
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(i) T e ^(H*, H’-) for all 5 > i if K € ^(H*, W) and 

(ii) T € for all s > 0 if K 6 K,{n\W-^) 

under the assumptions of Lemma 3.2. 

3.2 The airfoil equation on the interval 

Let us consider the Cauchy-type singular integral equation with constant 
real coefficients a and 6^0 and an additional integral term with logarithmic 
singularity on J = [— 1, 1], the so-called airfoil equation, 

av{x) + -[ ~ ~ / log\x - y\v{y)dy + f k{x,y)v{y)dy - f{x), 

TT y_i y - X 7T J_i 

(3.7) 

X € (—1, 1). For simplicity assume that -i- 6^ = 1. Further, z/ is a complex 
constant, and k and / are given continuous complex- valued functions on Jf x J 
and Jf, respectively. 

Following [8, 15, 21], we form a Jacobi weight function a{x) = {l-x)^{l-\- 
xY in dependence on the coefficients a and 6, where a = g, (3 = X~ - 
g = X arg(a — ib) £ (—1, 1), and A“^, A“ are two integers being chosen such 
that — 1 < < 1. Doing this, the operator Aq aal -h bSjal possesses 

mapping properties which are very advantageous for applying approximation 
methods. Here I denotes the identity and 5j is the Cauchy singular integral 
operator defined by 

= xs(-M). (3.8) 

7T y -X 

Lemma 3.3 ([8, 15, 18]). The operator Aq turns out to be a Fredholm op- 
erator in the pair of weighted Lebesgue spaces (L^,L^), /i = cr~^ , with 
K := ind Aq = — (A*^ -I- A“) € {—1,0, 1}. Furthermore, Aq maps polynomials 
into polynomials; more precisely, we have Aqp^{x) x € (—1,1). 

The operator Aq := apl - bS^pI fulfils the inverse relation Aqp^{x) — 
Pn+ni^)^ X G (-1,1). 

Having this in mind, we shall consider in the sequel the equation 

Au = Aqu + vWu -h Ku = / (3.9) 

being equivalent to (3.7), where Wu{x) = /^^log|x - y\u{y) a{y) dy, 

V = ua and Ku{x) = j!_ik{x,y)u{y)a{y)dy. Note that A is a Fredholm 
operator in the same pair of spaces as Aq and that ind A = ind Aq. 

One particalar case of equation (3.7) is closely connected with the so- 
called Chebyshev polynomials of the first, second, third, and fourth kind. 
Indeed, if a = 0 then one gets that \a\ = \/3\ = Let II ^ denote the set of 
all algebraic polynomials of degree less than or equal to n - 1. 
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Lemma 3.4 ([2, 9]). If \a\ = |^| = | then the operator W is a bounded 
linear one from ^ into L^ for all s >0 and it is compact from into 
C[— 1,1]. Moreover, W sends polynomials from II n into 77n+a+/?+i. 

If a / 0 then the investigation of the properties of W is more involved. 
For this reason, we restrict ourselves to the two cases when a = 0 or i/ = 0, 
respectively. Furthermore, in the sequel we only deal with = A“ = ac = 0. 
For 0 see [3] . 

4 Approximation methods 

4.1 Approximation methods for periodic pseudodifFerential 
equations 

For a natural number n, the operator Pn is defined for any u E 
by PnU - where := {-n, -n -I- l,...,n}. Note that 

Pn is the orthogonal projection onto the set of the trigonometric poly- 
nomials Tn := lin{ejk, k E Z^}. An equivalent definition for Pn is that 
{u — PnU^'Kn) = 0 Vu E VtTh € Tn- Furthermore, we define the inter- 
polation operators Rn \Il Tnhy Rnu{xnj) = u{xn,j), j € Zn, where Tl 
denotes the class of all 1-periodic bounded and Riemann integrable functions 
on E and Xnj = -h 1). In the following we shall denote by C several 
positive constants. 

Lemma 4.1 (cf. [17, 20]). 

(i) There exists a positive constant C depending only on t and s such that for 
any u E the following estimates hold 

\\u — Pnu\\ii^^ < C \\u\\h», \ft : t < s, 

\\u-Rnu\\H* < c \\u\\h-, vt : 0<t<s; s>^. 

If t = s then both norms on the left-hand side tend to zero (i.e. they are 
not only bounded). 

(ii) There exists a positive constant C depending only on t and s such that for 
any u the following estimates hold (n>2) 

Wu-PnuWn^ < C n^“^logn llull'H*, Vt : 0<t<s, 

\\u-Rnu\\'^t < C n^“^logn ||u||^-, Vt : 0 < t < s. 

Now, let us describe some approximation methods for the pseudodif- 
ferential equation (3.5). We restrict our further investigations to equa- 
tions with continuous coefficients c and d and compact operators Ku{x) = 
Jq k{x, y)u{y)dy, x E [0, 1), having certain smoothing properties, because the 
fast algorithms in Section 5 below only work in this case. More precisely, we 
make the following assumptions on equation (3.5), which are assumed to be 
true throughout this paper: 
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(i) c,d G r > and K e IC{L\W) in the ’’Sobolev case” and 

(ii) c, d € r > 0, and K G IC{C,'H'^) in the ” Holder-Zygmund case”. 

Further, we suppose that c{x) ^ 0 and d{x) ^ 0 for all x G M and that the 
winding number of c/d around the origin [argc(x)/d(x)]®Zo equals zero. Then 
cP -f dQ is in G£(H^), t <r under condition (i) and in GC{7i^), 0 <t <r 
under condition (ii), [20]. 

The Galerkin method defines an approximate solution i/n ^ 7^ by 
(^yn,7Tn) = (/,7Tn) Vtt^ G 7n, Or, equivalently, by PnAyn = Pnf, (4.1) 
whereas the collocation method determines Zn ^Tnhy 

Aznixnj) = f{xnj), j € Or, equivalently, by RnAyn = Rnf- (4.2) 
The quadrature method consists in finding a ^ 7^ by 
o(^nj)7^^n(^n,j ) d(Xyij)Q^,T,(Xn j) “h KnZn{Xn,j) — j ^ ^ri5 (4-3) 

where 



KnZn{x) — 2n+l SibeZn 




kn{x,y)zn{y)dy, 



kn{x^ •) Rnk{x, •) for all x G [0, 1). 



The last equality is a consequence of the quadrature rule 




1 

2fi -f" 1 






being exact for u G Tin- Occasionally, we shall refer to both the collocation 
and the quadrature method by ’’approximation methods of collocation type”. 



Theorem 4.1 (cf. [17, 20, 21]). If n is sufficiently large and the corre- 
sponding assumptions are fulfilled then each of the approximation methods 
defined above possesses a uniquely determined solution yn and Zn, respec- 
tively. Moreover, we have the error estimates given in the table below, where 
u denotes the exact solution of equation (3.5). Remember that r denotes the 
degree of smoothness of c,d, and the image of K. 



Sobolev case 



Assumptions 



AgG£(H"), /GH", 5<r 




\\u-yn\\w <Cn^-^ WfWus 
t < s 



Holder-Zygmund case 

A£GC{W), fen\s<r 
If s < 1 then 5 = r is allowed. 
iM-ynll-H' < c n*"® logn ||/||h» 
0 < < < s 



Collocation 

^ Wu-ZnWnt 

type 

methods 



<C n* ^ I1/1Ih. ||m - ZnWnt <C ^ logn ||/||w. 

0<<<s;s>| 0<f<s 
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Proof. Let us sketch the proof for the Holder-Zygmund case. For details 
see [21]. At first, we consider the Galerkin method. The assumptions ensure 
that we can factorize the coefficients c = c+c_ and d = d-d+, such that 
(c^.ek) = (d+,efc) = 0 for all A: < 0 and (c_,ejb) = {d-,ek) = 0 for all 
A: > 0. We define the operator M — M+M-, where = Pc+P + Qd^Q 
and M_ = Pc-P -f Qdj^Q having the inverses = Pc^^P + Qdr}Q 

and MZ^ = PcZ^P + Qd^^Q. Note that e GC{W). One can 

show that A = M+M_ H- T with T £ Obviously, the equations 

Au = f and M^^Au = {M--{-M^^T)u = are equivalent. The property 

PnM^^ = PnM^^Pn yields that the approximate equations PnAun = Pnf 
and Pn(M_ -f M^^T)un = PnM^^f are also equivalent. Moreover, using 
additionally the property P^M^^P^ = the last equation equals 

(M_ + PnM;^T)Un = PnM;'/. 

Let us proceed with the collocation method. By the property of the in- 
terpolation operator Rn{g • h) = Rn{g • Rnh) for any functions g^h ^ TZ^ we 
can assume without loss of generality that d = 1. Define I and 

M_ = Pc-P -f Qc^^Q. In an analogous manner as above we again obtain 
A = M+M_ 4- T with (another) T E the equivalence of Au — f 

and (M_ -j- M^^T)u = and the equivalence of RnAun = Rnf ^nd 

(M_ + RnM:^^T)Un = PnM+7. 

Let Ln be one of the projections Pn and Rn- 
Since \\LnM^^T - ->> 0 as n oo, it is easy to show that 

M_ -h LnM^^T is invertible for all n > no and that the sequence {(M_ -f 
LnAf^^T)“^}n>no is Uniformly bounded in Furthermore, we have the 
identity 

(M_ + LnM^^T)(u - Un) = M.u + LnM-^Tu - LnM+ V 

= {I- Ln)M-^f 4 {Ln - I)M-^Tu. 

Putting these things together and using Lemma 4.1 we get the ctssertion. 



4.2 Approximation methods for the airfoil equation 

Let n be a natural number and let p refer to one of the Jacobi weights a 
and ju. We denote by P^ the orthogonal (with respect to the scalar product 
(v)p) projection P^ \ u Pn having the property 

{u - P^u^TTn) = 0 Vu € VTTn € 17^. Further, let : 72,^ -> Un be 
the interpolation operator R^u{x^j) = u{x^ j), j = 0, ...,n - 1, where TZ^ 
denotes the class of all bounded and Riemann integrable (with respect to 
the weight p) functions on J and the x^j are the zeros of the orthogonal 
polynomial . 

Lemma 4.2. There exists a positive constant C depending only on t and s 
such that for any u E ^ the following estimates hold 
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Vt : 


0 < 7 < s. 


If |a| = 1/31 = 1 then 






\\u-RM\lI, < C' n*-* ||u||l2 ^, 


Vt : 


0 < 7 < s; s > -|, 


whereas in the remaining cases we have 






h-P-nUhlt - I|w||l2_., 


Vf : 


0 < 7 < s; s > 1. 



If t = s then all norms on the left-hand side tend to zero (i.e. they are not 
only bounded). Furthermore, for all Un E 77^ the Bernstein type inequality 



hold 















0<q<t. 



Proof The first and the very last relation obviously follow from the definition 
of and the L^^^-norm. For Jacobi weights p with |a| = |/7| = the second 
estimate is due to the authors [3]. The third one has been recently proved by 
Capobianco and Mastroianni [6]. 



Now, we are going to describe some approximation methods for the airfoil 
equation (3.9). For the same reason as in the previous subsection, we make 
the smoothness assumption K E £(L^,L^ ^), r > ^. Sufficient conditions for 
this property are, for example, 



l|^(^y)||L2 ^ < C'l for all 1 / E J and ||A:(x, •)||l 2 ^ < C 2 for all x E J. (4.4) 

Because we restrict our investigations to operators Aq having index zero, 
Ao is invertible in each pair of spaces (L^ ^,L^ J, 7 > 0, having the inverse 
Aq^ := see [3]. At first we consider the case when 1 / = 0. 

The Galerkin method defines an approximate solution y^ E 77^ by 



PiiAyn = PJtf. 



Further, the collocation method determines Zn E 77n by 



(4.5) 



KAzn = Kf. 



(4.6) 



and the quadrature method by 

77{J(Ao+7^n)^n = 77{:/, (4.7) 

where KnZn{x) = YJkZl = J^kn{x,y)zn{y)(Tiy)dy, 

kn{x, •) := Rn^{x, •) for all x E J, and the an,k are the Christoffel numbers 
of the orthogonal polynomial i.e. the weights in the Gaussian quadrature 
rule /ju(x)cr(x)dx « YJkZl '^{Xn,k)^n,k being exact for u E 772n- 
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Theorem 4.2. Let 1 / = 0 in (3.9). If A e f G 0 < t < 

s < r, then for all sufficiently large n each of the approximation methods de- 
fined above possesses a uniquely determined solution yn and Zn, respectively. 
Moreover, we have the error estimates 

ll« - J/nlkj , < C n*"* 11 /IIlj,,, Vt : 0<t<s. (4.8) 

If |a| = |/3| = I then 

Vf : 0<f<s; s>^, (4.9) 

whereas in the remaining cases we have 

Ik - < c II/IIlj,., Vt : 0 < i < s; s > 1. 

where u denotes the exact solution of equation (3.9). 

Proof. The proofs can be found in [3]. For arbitrary Jacobi weights, however, 
only a weaker estimate is given. Using the results of [6], we also get the 
optimal estimate in this case. Furthermore, approximation methods for the 
airfoil equation are also considered in [13]. 

Let us, for example, sketch the proof for the quadrature method. The 
other proofs are analogous. At first, one verifies the stability of the sequence 
{R!)^{Ao -f Kn)Pn} or, what is the same, the invertibility of Aq -I- R!)^Kn for 
all n > riQ. This can be done simply by showing that ||(Ao K) — (Aq -f 
R!):Kn)\\i ,2 = \\K — RI^^KnWi,^^ tends to zero as n ^ oo. Now the 

error estimate immediately follows from the relation {Aq -h Rf^Kn){u — Zn) = 
Aon + Rff^KnU - RU = if - Kf) + iKKr^u - Ku). 

Since the estimate for \\{K — R^Kn)u\\i 2 was shown under the stronger 

condition r > 1 and only for |a| = |/31 = ^ in [3], we are going to give the 
proof in the general case in the following. Let u G L^. From Lemma 4.2 we 
get that 

\\{K - KKn)u\\^.^ < ||(/ - K)Ku\\^i^ + n‘ \mK - X„)u|kj. 

Applying the Gaussian quadrature rule Jjf{x)fjL{x)dx « fi^nj)Pnj 

being exact for / G 772n, we obtain 

WR'^.iK - KnjuWli <C sup I f[k{x,y) - kn{x,y)]u{y)cr{y)dy 
^ leJ I./J 

<C sup \\{I - Rl)k{x,-)\\h ||m||l 2 . 

*6JI 

Thus, 
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from which the estimate 

\\{K - ^ < C n*-" 11 «||l 2 , Vt : 0<t<r, 

follows, where r > |, if |a| = \^\ — and r > 1, otherwise, heis to be 

assumed. 

Now, let us consider the case when i/ / 0, a = 0. For details see [2, 3, 9, 
10, 14]. Here we always have |a| — — Note that the quadrature method 

in this case is defined by 

K{Ao^vW^Kn)Zn = Kf. 

In particular, we do not need to apply an additional quadrature rule because 
of the mapping properties of W with respect to polynomials. 

Theorem 4.3. Let a = 0 in (3.9) and let the assumptions of Theorem J^.2 be 
fulfilled. Then for all sufficiently large n each of the approximation methods 
defined above possesses a uniquely determined solution pn and Zn, respec- 
tively, and we have the error estimates (4-8) and (4-9). 



5 Fast algorithms 

Seeking an approximate solution Un ^ Tn of the equation (3.5) consists ac- 
tually in finding n Fourier coefficients {un,ek), k e Zn, which are related 
in some sense to the corresponding Fourier coefficients {u,ek) of the exact 
solution u. 

Studying the classical approximation method from Section 4 more closely, 
one find that the best approximation can be obtained using the Galerkin 
method. Moreover, the approximation error is optimal in each Sobolev norm, 
that means \\u — Un\\w < C I1/||h-, t < s. Consequently, each summand 
\k\^\{u — Un,ek)\ (which is the error of the A:-th Fourier coefficient, apart from 
the factor |fc|*) of the series (2.1) again fulfils such an optimal estimate. 

In the cases when collocation type methods are used we do not have this 
optimal convergence on the whole Sobolev scale. It is true that the error 
rate is optimal if we measure it in norms with t > 0. However, for nega- 
tive t we have no more increase in the convergence rate. Nevertheless, by 
an observation of the authors in [4], the higher Fourier coefficients of the 
approximate solution also converge optimally in cases when t < 0, whereas 
only the lower Fourier coefficients cause the worse convergence behaviour in 
’’negative” Sobolev norms. Indeed, if t < 0 and m denotes a fixed positive 
integer less than n, we have 

||u — ^ ll-fm(^ ~ H" ||Qm(^ ~ '2^n)||H* 

< \\Pm{u-Zn)\\w 4 - Cmf\\u-Zn\\v^ 

< \\Pm{u-Zn)\\w + C II/IIh- 



(5.1) 
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Since ||tx-^n||H* is only of the order 0(n ®), we conclude that \\Pm{u-Zn)\\w 
can not be better than 0(n~^). 

Another observation concerning the higher Fourier coefficients of the col- 
location solution Zn = {RnARn)~^ Rnf (in the case when t > 0) is due to 
Amosov [1]. In his paper he states that if we replace {RnARn)~^ by RnBRn, 
where B is the parametrix to A (see Lemma 3.2) then the higher Fourier coef- 
ficients still converges optimally. We only have to assume that the coefficients 
c and d and the image of K in (3.5) are smooth enough. 

Further, using an appropriate correction of Zn one can also achieve opti- 
mal convergence for the lower Fourier coefficients and, consequently, for the 
approximate solution Un itself. Amosov’s correction is based upon a small 
quadrature scheme with a modified right-hand side. Of course, we do not 
have optimal convergence in all Sobolev norms || • ||h*, but at least for t 
varying in [to, s] with a certain to > 

Amosov has been developed a fast algorithm for solving equations of the 
type (3.5). First of all, the application of RnBRn instead of {RnARn)~^ 
to Rnf reduces the amount from, in general, O(n^) to O(n^). But, in the 
present situation we can use the Fast Fourier Transformation, which reduces 
the amount once more to O(nlogn). 

Originally, the authors have been used the above observations for an- 
other goal. In their paper [4] they have been proposed corrected collocation 
type methods. The purpose of these methods is to improve the convergence 
rate in ’’negative” Sobolev norms. Having (5.1) in mind, the lower Fourier 
coefficients of the collocation type solution Zn are replaced by the Fourier co- 
efficients of the solution of a small Galerkin scheme. Thus, we have somewhat 
more expense in computing, but it results that the convergence rate can be 
improved, say, by 0(n^/^) (in dependence on the size of the Galerkin scheme). 
For the corrected collocation type methods we also need some smoothness 
assumptions on c, d, and the image of K. 

In a further paper [5], the authors have been brought these two things 
together, i.e. they have been established a method, having both a good con- 
vergence behaviour in ’’negative” Sobolev norms and a low computational 
complexity. Moreover, the case of spline approximation methods has been 
considered (see also [4]). 



5.1 Amosov’s fast algorithm 

At first, we briefly describe Amosov’s method [1]. Let us denote the com- 
plementary projection of Pn by {= I — Pn) and let m be an integer less 
than n. The main idea consists in the calculation of the higher and the lower 
Fourier coefficients of an approximation Un € 7^ of the exact solution u by 
two different methods. 
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Amosov’s Method or Parametrix-Quadrature Method 

Un •— Pm'^m “I" Qm^n ^ Tn? (^*^) 

where Zn ^ Tn is defined by 

Zn := RnBRnf (5.3) 

and Wm £ Tm is the solution of the quadrature scheme 

Rm{(^P H" dQ -f- Kjji)Wfn — RfnQ^ 9 f ~ {^P “I" dQ)QfnZn- (^•^) 

The following algorithm calculating Un with an amount of only 0(n log n) 
operations is due to Amosov. For details see [1, 5]. 

Let Fn = be the Fourier matrix of order 2n -f 1, the 

inverse of which can be written as F~^ = If is 

well-known that the multiplication of Fn or F~^ by a vector (the so-called 
Fourier transformation and inverse Fourier transformation^ respectively) can 
be realized with 0(n log n) operations. For a function g we denote by gn the 
vector, whose entries are the values of g at the points Xnj, j ^ and by 
gn the vector containing the Fourier coefficients (g^ek)^ k £ Zn- If g £ Tn^ 
then gn = Fngn- 

The computation of Un can be split up into three steps. Firstly, the com- 
putation of Zn^ secondly, the computation of the right-hand side Rmg, and 
thirdly, the solution of the collocation scheme (5.4). The first one can be done 
as follows 

= F-\C-^FnI+ + (5.5) 

^ f 0 0 \ _ 

where C = .-^n = Wi ~ tZ,Cn = diagj^^^Cnj, D„ = 

y 0 /n+1 J 

and In is the unit matrix of order n. The second step consists 
in 

9m — fm ~ {Cmlm^nFnln "b PmIm.nFnIn (^-b) 

where is the diagonal matrix with the entries [IJ^]j^k = 1? if j = 
k £ Zn\^m, and [In]j,k — 0 , otherwise, j,k £ and Im,n is the 
(non-quadratic) matrix [Im,n]ij = 1, if Xm,i = Xnj, and [Im,n]ij = 0, 
otherwise, i £ Zm]j € ^n- (Here we suppose that (2n-f l)/(2m-|-l) is an 
integer.) Finally, the third step is the solution of the mxm algebraic system 
of equation. In general, we need O(m^) operations since the matrix is neither 
sparse nor well-structurized. However, both in the first and in the second step 
there occur only diagonal and Fourier matrices. Thus both can be realized 
with O(nlogn). Choosing m ^ ^ then the calculation of Un needs really 
only 0(n log n) operations. Here and in the following the notation ^ bn 
means that there are positive constants C\ and C 2 such that for all n the 
relation C\ < On/bn < C 2 holds. 
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5.2 Error estimates in Sobolev spaces 



In this subsection we suppose that the right-hand side of equation (3.5) is 
an element of 5 > | and that the operator A is in G£(H^). Then the 

uniquely determined solution (being always referred to by u in the following) 
is again in Since ||u-Un||H* < ||T^m(^-^^n)||H‘ + ||Qm(^-'^^n)||H* we are 
able to investigate the convergence behaviour of the higher and of the lower 
Fourier coefficients separately. 

Lemma 5.1. Let r > s > ^. Then for sufficiently large n and m we have 
the estimate 






C ^ -f mf ||/||h-, 0 < t < s, 

C (m^ II/IIh«, ^<0. 



Proof. For Q <t < s the proof is due to Amosov ([1], Lemma 5.1.). The case 
t < 0 has been proved by the authors in [5]. 

Lemma 5.2 ([1]). Let r > s > \. If the collocation method (5.4) is sta- 
ble then for sufficiently large n and m we have the estimate (note that Wm 
actually depends on n via (5.4)) 

\\Pm{u - Wm)||H‘ < C (n*“® + m*"’’) |1/||h* for allt \\ <t < s. (5.7) 

Choosing m ^ a straightforward calculation shows that the error 
estimate is optimal, i.e. 

11^ - UnWw < C n'-^||/||H^ for alH > i : s-^<t<s. (5.8) 

Thus, this method is very suitable if we have sufficiently smooth coefficients 
c and d (being in H^) and if the image of K is also smooth enough (c H^). 
Note that the right-hand side / must not belong to but only to H^. 

Unfortunately, if we want to measure the approximation error u — Un 
in ’’negative” Sobolev norms, this method does not give any improvement 
of the convergence rate. The reason is that (5.7) can not be extended to 
t < 0, since we use a collocation type method for the correction of Zn- On 
the other hand. Lemma 5.1 has been extended to the case of negative t 
by the authors. Furthermore, in [4] the authors succeeded in correcting the 
collocation type solution by the solution of a certain small Galerkin scheme 
with regard to better convergence rates for t < 0. More precisely, the corrected 
collocation method consists in the evaluation of the higher Fourier coefficients 
of the approximate solution by the usual collocation method, whereas the 
lower Fourier coefficients are evaluated by the usual Galerkin method. Both 
methods are applied to the whole equation (3.5). 
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Corrected Collocation Method 

•— PfnyM H" Qm^n ^ Tn? (^*9) 

where € 7^ is the solution of the collocation scheme 

RnAZn = Rnf (5.10) 

2 md yM ^ Tm is the solution of the Galerkin scheme 

PM^yu — Pm!- (5-11) 

Here M is an integer lying between m and n (in practice one can choose 
M := 2m, for example). For details see [4]. If M < n then the costs 
for the Galerkin scheme (5.11) are defensible. Since l|<5m(^ - Zn)\\w < 
C \\u - ZnWh^ <Cm^ n~^ 1I/||h- Vt < 0, the higher Fourier coefficients 
of Un converge better than the collocation solution Zn itself. The following 
lemma is devoted to the convergence of the lower Fourier coefficients. Let K* 
denote the L‘^-adjoint to K. 

Lemma 5.3 ([4]). Let r > s > |, K* e and the Galerkin 

method be stable. Then for sufficiently large n and m we have the estimate 

\\Pm{u - 2/m)||h‘ <C {M - m)~'' II/IIh- for allt: -r <t < (5.12) 

Proof. The estimate (5.12) proves to be one of the crucial points of the cor- 
rected collocation methods. Therefore, we sketch main ideas of the proof, 
although it is given already in [4]. By the famous Kozak formula and 
Pm Pm = Pm we find that 

Pmi'U' - yM) = PmA~^f - Pm{PMAPM)~^ f 

= PmA~^QM{I -f (Qm^“^Qm)“^Qm^”^^m)/ 

It can be shown that the operator sequence I -f- {QmA~^Qm)~^QmA~^ Pm 
is uniformly bounded in H^. Furthermore, we have 

PmA~^ Q M = PmBQM PmT A~^Q M = Pm{Pc~^ P + Qd~^Q)Q M + PmT' Q M 

with T' € /C(H^,H^), Vg > -|. Let z; G H^. A little thought shows that the 
identity 

2 

\\PmPc-^PQMv\\ij.= Y. + E 

0<k<m/2 l>M/2 

holds. Using the Cauchy-Schwarz inequality and the relation k-l < ^{M - 
m) < 0 we obtain 

\\PmPc~^ PQmv\\w <C \\QM-mC~^\\hA\QMv\\H^ 

<C {M -m)~^ ||c”^||H^||'y||H^. 
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The same applies to \\PmQd 

Finally, we have \\PmT‘QM\\H^-^H* < ^ Thus, the estimate 

\\Pm^~^QM\\Yi^-^w < C (M - m)“^ (5.13) 

is proved and we are done. The last relation can be interpreted as some kind of 
localness of A~^ with respect to the family of the projections {Pm}m^N • Note 
that (5.13) is an essential ingredient of the corrected collocation methods, 
since we can not succeed without such a property. 

Choosing again m ^ ^/n and M = 2m, we arrive at 

I|m - Wnlln* < C {m*n~^ + {M - m)~^) H/Hh* 

< C ( n3~® + n~3 ) ||/||h» for alH : -r <t < -^.(5.14) 

One can show that if r > max{3s, 5 + |t|} then the Ictst expression for the 
convergence rate improve both the rate of the pure collocation type method 
(4.2) and (4.3), respectively, and the rate of the pure Galerkin method (5.11) 
in each negative Sobolev norm || • ||h*- 

Nevertheless, for solving the collocation scheme (5.10) we have still a com- 
plexity of O(n^) operations. The following algorithm unites the advantages of 
Amosov’s 0(nlogn)-algorithm and author’s corrected collocation method. 

Parametrix— Galerkin Method 

Ufi •= PmyM 4" Qm^n ^ 7n> (5.15) 

where Zn € 7^ is defined by 

Zn := RnBRnf (5.16) 

cnid Pm € Tm is the solution of the Galerkin scheme 

PmApm — Pnf- (^-17) 

Choosing again m ^/n and M = 2m, the Lemmata 5.1 and 5.3 lead to 

||w - Wnlln' < c 4- m‘“’’ + (M - 1 |/||h- 

< C ( + n“3 ) ||/||h' for all t : —r<t< —^,(5.18) 

i.e. the same rate as for the corrected collocation method. Thus, all con- 
clusions from above apply again. Moreover, it is clear that the complexity 
for the computation of Un by the parametrix-Galerkin method is O(nlogn), 
except for the calculation of the entries of the matrix Am and the right-hand 
side /m of (5.17). In order to reduce the complexity somewhat, one should 
make use of the fact that the main part of Am has a diagonal-circulant 
structure. 
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5.3 Error estimates for Amosov’s method in Holder- Zygmund 
spaces 

Now let the right-hand side of equation (3.5) be an element oiW, s > 0, 
and the operator A be in G£('H^). The uniquely determined solution will be 
denoted by u G Consider the method (5.2) - (5.4). 

Lemma 5.4. Let r > s > 0. Then for sufficiently large n and m we have the 
estimate 

WQmiu - Zn)\\n* < C (n‘“*(logn)^ +m*“’’+'*)logm ||/||h», 0 < i < s. 
Proof, (cf. [1]) We have 



Qm'^ Qm^n — Qm'^ Qm^n^^nf — Qm^'^ "h 

where T is the compact operator from Subsection 3.1 and hn := (/-i?n)5/-h 
B{I-Rn)f-^(Rn-I)B{I-Rn)f. 

Thus, WQmiu- Zn)\\n^ < C logm ||Tul|^r-i + C logm ||/in|k‘. From 

the Lemmata (3.2)(ii) and (4.1)(ii) we get that ||/in||?^‘<C'n^“^(logn)‘‘^||/|| 7 ^-. 
Finally, we use the smoothing property of T to get ||Tu|| 7 ^r-<j <C\\f\\H‘. 

Lemma 5.5. Let r > s > 0. Then for sufficiently large n and m we have the 
estimate (note that Wm actually depends on n via (5.4)) 

\\Pm(u - Wm)\\w < C (n‘“*(logn)^ (logm)^ ll/lk*, d<t<s. 

(5.19) 

Proof. We use the notations from that part of the proof of Theorem 4.1 
concerning the collocation method. Put Aq := cP + dQ. First of all, the 
equations Au = f and Rmi^o + A^m)^m = Rm9 sire equivalent to (M_ -f 
M^^T)u = M^^f and (M_ + RmM^^Tm)wm = Rm^^^g, respectively. 
The operators Tyn are defined by Tm .= T — K Km and prove to be in 
ICildf for all m. We have the identity 

(M_ + RmM^^Tm)Pm{u ~ Wm) 

= (M_ + RmM+^Tm)PmU - RmM^^ f + RmM+^AoQmZn 
= (M_ + RmM^^Tm)PmU ~ Rm(M- + M;^^T)u 
+RmM-^ ' (M + T — K)QmZn 
~ {Al—PmU RmAt—Pmt^) ~ Rm^—(.Qm'^ QmZn) 

+ RmM_^_^ (TmPmU ~ Tu) + RmM_^.^{Tm ~ Rm)QmZn 
— ~RmAI—{Qm’f^ ~ QmZ^n) ~ Rm^+ Pmi.QmV' ~ Qm^^n) 

+ RmM:;:^(Km - K)u - RmM+^KmQmZn, 




310 D. Berthold, B. Silbermann 



where we have used that M^PmU G 7^, RmM_^ ^ and - 

T - Km - K. Since KmQmZn{x) = Jq /cm(x,y)Qm^n(y)dy and km(xr) is 
orthogonal to Qm^n for all x G [0, 1), the last summand vanishes. Prom the 
proof of Theorem 4.1 we know that M_ -f RmM^^Tm is invertible in 
for all m > mo and that the sequence of the inverses is uniformly bounded. 
Therefore, we can estimate 

||^m(^ “ ^ C \\Rm{M— -|- M_|_ Tm)\\'H^^'H*\\Qm'^ ~~ Qm^nWH^ 

+C \\RmM^^ Rm{Km ~ K)u\\y^t. 

Proposition 8.13 in [21] yield that the estimate 

\\Rm{Km - K)u\\nt = \\Rm{Tm ~ T)u\\ut < C m^~^logm \\u\\n^ 
is valid and, thus, \\Rm{M^+M^^Tm)\\'H^^n* ^ C' logm. Now from Lemma 
5.4 the assertion follows. 

Choosing m a straightforward calculation shows that the error 

estimate is almost optimal, namely 

< C' n^“^(logn)^ II/IIt^^ for all t > 0 : s- <t<s. (5.20) 

5.4 A fast algorithm for the airfoil equation 

Now we are going to describe a fast algorithm (being based on the discrete 
cosine transformation) for the airfoil equation (3.9). Let := J-Pm-> where 
m is an integer less than n. One essential assumption for the algorithm is 
that the set of the collocation points has the property = 0, ...,m — 

1} C = 0, ...,n — 1}. This can be guaranteed without any difficulties 

in the cases when |o:| = |/ 3 | = por example, if a = | then 

(2n -h l)/(2m + 1) G Z ensures the above inclusion. Thus, we shall only 
consider equation (3.9) with a = 0. 

Obviously, the operator B := (Aq -f uW)~^ proves to be a parametrix to 
A, since I — BA := —BK is compact. An easily verifiable criterion for the 
invertibility of Aq + I'W can be found in [2], Thm. 3.7. In particular, if z/ G M 
then Ao H- uW is always invertible. 

Parametrix— Quadrature Method 

Ufi •— Pm^'tT^ Qm^n ^ ^n> (5.21) 

where Zn C Un is defined by 

:= (KiAo + uW)P^)-^RU (5.22) 

and Wm ^ 77m is the solution of the quadrature scheme 



R>;„iAo+uW + Km)wm = R!L9, 9-.^f-R!;,{Ao + t'W)Q^Zn. (5.23) 
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Let US compare (5.3) and (5.22). Both equations realize simplifications 
of the quadrature method directed to the construction of fast algorithms. 
However, in the present case we can get a ’’fast” method already by omitting 
the operator K (in fact, we get the collocation method for Aq-\-vW) because 
of the good mapping properties of Aq^-vW on polynomials, whereas the same 
procedure in the case of non-constant coefficients as in (5.3) still does not 
work. 

Now, let us describe the realization of the method on the computer. For 
details see [3]. For p = cr or p = /i let -,n-i be the ana- 

logue of the Fourier matrix of order n with respect to the system 
a so-called generalized Vandermonde matrix. Using the Gauss type quadra- 
ture rule J^v{x)p{x)dx « Yj^k=o One can show that the relation 

(G^)^D^G^ = In is valid, where = diag^^o,..,n-iPn,Aj. Thus, the inverse 
of G^ can be written as (G^)^jD^. For a function g we denote by the vec- 
tor, whose entries are the values of g at the points = 0, ...,n - 1, and 

by g^ the vector containing the Fourier coefficients {g^p^)p,k = 0,...,n — 1. 
If g e Hn, then g^ = G^g^. 

The mapping properties of Aq and W on polynomials lead to a represen- 
tation Hn of RI^{Ao + uW)P:^ in the pair of the bases , {Pk}k=o) 

being a tridiagonal matrix, see [2]. Note that solving an algebraic system of 
equations having a tridiagonal matrix can be done with 0{n) operations. We 
compute Zn as follows. 

(5.24) 

For the right-hand side of (5.23) we have 

= Im,nGWnZl, (5.25) 

where is the diagonal matrix with the entries = 1, if m < j = 

A: < n — 1, and = 0, otherwise, j,k = 0, ...,n - 1, and Im,n is the 

(non-quadratic) matrix [Im,n]ij = 1, if ^ = x^ -, and [/m,n]tj = 0, 
otherwise, i = 0, ..., m — 1; j = 0, ...,n — 1. 

Finally, we have to solve the quadrature scheme (5.23), which needs 0{mZ) 
operations. Using the discrete cosine transformation, see [25], we can do the 
computations (5.24) and (5.25) with an amount of 0(n log n) operations. 
Choosing m ^ y/n then the calculation of Un needs really only 0(n log n) 
operations. 



5.5 Error estimates 

Here we suppose that the right-hand side / of equation (3.9) is in s > ■!, 
and that the operator A is in G£(L^ ^,L^ ^). Further, let (4.4) be fulfilled. 
The uniquely determined solution is denoted by u G For detailed proofs 
of the following two lemmata see [3]. 




312 D. Berthold, B. Silbermann 



Lemma 5.6. Let r > 5 > |. Then for sufficiently large n and m we have 
the estimate 



WQmiu - ^ ^ ||/||^, ^, 0<t<S. 

Proof Use the decomposition Q^{u - Zn) = Q^{I - BA)u + Q^hn, where 
hn := B{f — Rtf^f) -f i^B{R^ — I)Wzn- Since the collocation method for 
(^0 H" ^W) is stable, the assertion follows immediately. 

Lemma 5.7. Let r > s > ^ and let n and m he such that (2n -f l)/(2m -h 1) 
is an integer. Then for sufficiently large n and m we have the estimate (note 
that Wfn actually depends on n via (5.23)) 

\\P^(u - Wm)\\hl^^ - ^ II/IIlj ^ for all t: ^ <t< s. 

Proof. We have the identity 

RI^,{Ao + iyW + Km)Pf;,{u-Wm) 

= Rff^iAo + I/W + Km)P^u - RU A + i^W)Ql,Zn 

= Rff^iKm - K)u - + vW)Q^^{u - Zn) - RffnKmQ^m^. 

Due to an orthogonality argument, the latst summand vanishes. Using the 
stability of the quadrature method, after a straightforward calculation we 
arrive at the assertion. 

Choosing m ^ we arrive at the optimal error estimate 
Ih - '^nWhi ^ < C ||/||l 2 ^ for all t > ^ : s- ^ <t<s, (5.26) 

which is completely analogous to (5.8). 



6 Concluding remarks 

1^ Spline approximation methods: It is also possible to construct fast algo- 
rithms for solving the equation (3.5) if we seek the approximate solution in 
form of a spline function. In [5] there is given the analogue of the parametrix- 
Galerkin method (see Subsection 5.2 of the present paper) in the spline case. 
This algorithm can also be realized with an amount of only 0(n log n) oper- 
ations and it possesses a better convergence behaviour than the spline collo- 
cation method in ” negative” Sobolev norms. 

2^ Multigrid methods: In the previous section we always have supposed 
that the small schemes of the fast algorithms are solved by a direct method 
with an amount of O(m^) operation. However, there are a couple of papers 
dealing with multigrid methods for equation (3.5), see for example [11, 12, 22], 
and for equation (3.9), see the paper of the authors [2]. In both cases we can 
reduce the computational complexity to O(m^) if we use multigrid methods. 
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Therefore, we have still an amount of 0{m^ -h nlogn) operations for the 
realization of the fast algorithms. If we choose n and m such that m ^ 
then the total amount of computational work for calculating the approximate 
solution Un is again O (nlogn). However, the error estimates (5.8) and (5.26) 
become optimal even for alH > | : s — {r — s) < t < s. 

3® Systems of equations: The results of the present paper can be extended 
to systems of pseudodifferential equations without any difficulty. 

4^ Multidimensional case: In the present paper we have only dealt with 
one-dimensional pseudodifferential equations. However, the ideas leading to 
the corrected collocation method and to the parametrix-Galerkin method 
are not restricted to the one-dimensional case. It is possible to establish 
these methods also in the multidimensional case, if the underlying ’’classical” 
approximation methods are proved to be convergent and if the operator A~^ 
possesses a property like (5.13). 



7 Numerical results 

We have applied the fast algorithms from the Subsections 5.1 and 5.2 to a 
Cauchy singular integral equation on the unit circle T. Using the parametric 
representation 7 : E -> T, y such an equation has the form 

Au:={aI-\-bS)u = f, (7.1) 

where S is the Cauchy singular integral operator (3.4) and a,b,u, and / are 
1-periodic functions. 

In the following table we give the results for equation (7.1) with a{x) = 
2 + I sin 27tx\^ and b{x) = 1, such that the functions c = a + 6 and d — a - b 
are (invertible) elements of H^, Vr < 11/2. The right-hand side / is chosen 
in such a way that (7.1) possesses the solution u{x) = cos 27 tx -h i * | sin27rx| 
being an element of H^, Vs < 3/2. 

We have done our computations on a Pentium PC (90 MHz) using a 
TURBO-Pascal 7.0 program code. In Table 7.1, there is written down the 
approximation error in the H^-norm (t = 1, 0, — 1, —2, —3) of the parametrix- 
quadrature method (i.e. Amosov’s method) and of the parametrix-Galerkin 
method. Furthermore, we give the results for the pure Galerkin and the pure 
collocation method in order to compare with (only for small n). Note that for 
equation (7.1) the quadrature method coincide with the collocation method. 
All algebraic systems of equations has been solved by the Gaussian algorithm. 

The column of the table contains the cpu-time needed (in sec- 

onds) for calculating the approximate solution, including the time needed 
for establishing the matrices but without the time needed for computing the 
right-hand sides. Note that we have defined the collocation points here by 
= (j + !)/«, 3 = 0, instead of =j/( 2n + l), j - 

as in Section 4. For each of the methods, the numbers C and a characterize 
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the convergence rate ||u — Un\\w ~ C n~^. These two numbers were deter- 
mined by the least square method from the sequence of the computed errors 
\W - '^nWw the corresponding integers n. According to (5.8) we can ex- 
pect a rate of for 0.5 < t < 1.5 in case of Amosov’s method. For the 

parametrix- Galerkin method the theoretical convergence rate in the present 
case is for — 1 < t < —0.5 and for —5.5 <t< —1, see 

(5.18). The numbers ath in Table 7.1 refer to these theoretical convergence 
rates. 



Table 7.1 : Errors and cpu - time 



1 ^ 


KM 




cpu 


1 * = i 


t = 0 


t = -i 


t = -2 


t = -3 


1 Amosovas 


Method 












80 




16 




0.13417316 


0.00337379 


0.00029899 


0.00014776 


0.00012408 


216 




24 


1.93 


0.08027192 


0.00072422 


0.00003509 


0.00001974 


0.00001696 


480 




32 


5.88 


0.05356986 


0.00021505 


0.00000675 


0.00000399 


0.00000343 






40 


49.92 


0.03703535 


0.00007145 


0.00000154 


0.00000092 


0.00000079 


1584 




48 


83.92 


0.02940520 


0.00003583 


0.00000061 


0.00000037 


0.00000032 










1.2 




2.5 


1.0 


0.8 




a 




-0.51 




-2.07 


-2.01 


-2.00 




°Lth 




-0.50 




-1.00 


-1.00 


-1.00 


Parametrix — Galerkin Method | 


80 


16 


8 




0.14510365 


0.01061950 


0.00191730 


0.00037883 


0.00007957 




24 


12 




0.08074067 


0.00125040 


0.00012795 


0.00001675 


0.00000234 




32 


16 


4.12 


0.05413013 


0.00085010 


0.00008826 


0.00000959 


0.00000106 


1000 


40 


20 


EiSSil 


0.03704252 


0.00009239 


0.00000484 


0.00000040 


0.00000003 


1584 


48 


24 




0.02940949 


0,00005166 


0.00000279 


0.00000021 


0.00000002 






7 






23.4 




19.6 


17.8 






y. 






-1.76 






-2.84 

















-1.83 


Collocation Method | 


16 




mu 




0.33612030 


0.04038787 


0.00814864 


0.00394639 


0.00320271 


24 


■ 


m 


0.28 


0.26115328 


0.02094460 


0.00333428 


0.00169198 


0.00141092 


32 


■ 


m 


0.66 


0.22086943 


0.01329585 


0.00179492 


0.00093631 


0.00078912 


40 


■ 


m 


1.49 


0.19482968 


0.00938736 


0.00111737 


0.00059414 


0.00050354 


48 


■ 


m 


3.13 


0.17624279 


0.00707894 


0.00076116 


0.00041045 


0.00034903 


64 


■ 


m 


7.69 


0.15092927 


0.00454864 


0.00041748 


0.00022951 


0.00019591 


80 


■ 


m 


17.75 


0.13410252 


0.00323407 


0.00026295 


0.00014640 


0.00012523 


96 


■ 


m 


41.58 


0.12188266 


0.00244991 


0.00018059 


0.00010146 


0.00008690 


112 


■ 


■ 


99.20 


0.11249070 


0.00193835 


0.00013159 


0.00007444 


0.00006381 




C 




1.5 


3.0 


■BOH 


1.1 


0.8 




a 






-1.56 




-2.04 


-2.01 




<^th 






-1.50 




-1.50 


-1.50 



Galerkin Method 



16 




■ 




0.22762698 


0.01693653 


0.00171368 


0.00019165 


0.00002466 




m 


m 




0.18473136 


0.00901615 


0.00059466 


0-00004309 


0.00000328 




m 


m 


0.61 


0.15964009 


0.00580911 


0.00028484 


0.00001530 


0.00000086 


40 


m 


m 


1.38 


0.14264252 


0.00414076 


0.00016173 


0.00000691 


0.00000031 


48 


m 


m 




0.13014137 


0.00314329 


0.00010206 


0.00000362 


0.00000013 


64 


m 


m 




0.11264102 


0.00203718 


0.00004949 


0.00000131 


0.00000004 


80 


■ 


m 


17.19 


0.10072114 


0.00145616 


0.00002827 


0.00000060 


0.00000001 


96 




m 


40.54 


0.09193086 


0.00110708 


0.00001790 


0.00000032 


5.8e - 9 


112 


B 


■ 


97.06 


0.08510296 


0.00087821 


0.00001216 


0.00000018 


2.9e - 9 








0.92 


1.12 


^■ngn|||H 


3.58 


8.03 




y 




-0.50 


-1.52 




-3.56 


-4.62 




a 


th 




-0.50 


-1.50 




-3.50 


-4.50 
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Different Methodologies for Coupled BEM 
and FEM with Implementation on Parallel 
Computers 

U. Brink, M. Kreienmeyer, and E. Stein 



1 Introduction 

The boundary element method (BEM) and the finite element method (FEM) 
may be computationally expensive if complex problems are to be solved, thus 
there is the need of implementing them on fast computer architectures, es- 
pecially parallel computers. Because BEM and FEM are complementary to 
each other, the coupling of these methods is widely used. In this paper, a par- 
allel FE-BE-code and its implementation on a distributed memory system 
(Parsytec MultiCluster2) is described. As model problems, we assume linear 
elasticity for the boundary element method and elastoplasticity for the finite 
element method. The efficiency of our implementation is shown by various 
test examples. Two coupling methods are proposed. First, we show by nu- 
merical examples that a multiplicative Schwarz method for coupling BEM 
with FEM is very well suited for parallel implementation. Furthermore, we 
treat a symmetric coupling of Galerkin-BEM and mixed finite elements of 
Raviart-Thomas type leading to improved stress approximation. 

This paper treats different methodologies of the coupling of FEM and 
BEM and the parallel implementation on a distributed memory system, the 
Parsytec MultiCluster2. Most parallel computers can be divided into two 
categories: shared memory and distributed memory computers. Shared mem- 
ory systems typically have few very powerful processors, whereas distributed 
memory computers (like Intel Hyper cube, nCube, Power Xplorer) normally 
consist of a large number of moderate performance processors with local 
memory which requires communication between the processors, achieved by 
message passing. The efficient implementation of numerical algorithms on 
these two architectures differs substancially. On shared memory computers, 
it is analogous to the implementation on vector computers, whilst distributed 
memory architectures imply a finer grain parallelism with a larger number of 
communication steps. 

In particular, the parallelization is performed by global data decomposition 
which normally leads to finer grain algorithms compared with the well-known 
domain decomposition method. The same algorithm as on a sequential com- 
puter is implemented, and data have to be distributed in such a way that the 
communication and synchronization costs are minimized. 
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The outline of this paper is as follows: In Section 2, we briefly describe the 
model problem (linear elasticity and elastoplasticity for BEM and FEM, re- 
spectively) and its discretization by FEM and BEM. In Section 3, the par- 
allel implementation of the two methods is treated, including some results 
from various test examples. Solving the linear systems of equations causes 
the highest communication costs and is described in detail. Section 4 con- 
tains the BEM-FEM coupling by a multiplicative Schwarz method and a 
symmetric coupling of Galerkin BEM with mixed finite elements of the 
Raviart-Thomas type. 



2 The model problem 



2.1 Linear elasticity and rate-independent elastoplasticity 

The linearized strain e = |(Vn -f- (Vix)^) is decomposed into an elastic part 
and a plastic part e^, 

= e - 6^^. (2.1) 

In the frame of linear elasticity, a free energy function is postulated in the 
form 

^ = :C:e^ , ( 2 . 2 ) 

with the elastic strains and the symmetric and positive definite elastic 
tensor C. Then the stresses follow as 



cr = de^^{e^) • 



(2.3) 



As a model problem, we treat the classical elastoplastic von Mises model 
(see e. g. [16]) with the yield function 



^((T, a) =11 dev a || - 




(2.4) 



where K{a) denotes the isotropic hardening function. For linear hardening 
we have 

K{a) =yo + Ha, (2.5) 

with the hardening modulus H > 0 and the initial yield stress 2/o > 0- The 
associated flow rule is described by 



eP = Xdcr^(cr,a) 



( 2 . 6 ) 



and the evolution of the internal variable a by 

« = (2-7) 

The loading-unloading conditions can be expressed in the Kuhn-Tucker form 
as 

A > 0, ^(cr, a) < 0, X^(cr, a) = 0 . (2.8) 

The solution of the coupled elastoplastic problem is obtained by a standard 
return mapping algorithm [6, 22]. 
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2.2 Discretization with boundary elements 



In the boundary element domain Qb linear elasticity is assumed. We start 
from Kelvin’s fundamental solution which for plane strain reads 






r.ir 






(2.9) 



with 



ri = yi- Xi, r = ^/{ri,ri), r^i = — . 

r 



G is the shear modulus. The displacements at any interior point of are 
given by the Betti formula 

u(x) = - [ G{x,y)t{y)ds{y)+ j {TyG{x,yyf u{y)ds{y) (2.10) 

JTb JTb 

where Ty denotes the traction operator, and t{y) = Tyu{y) are the tractions 
corresponding to the displacements tx at a point y. Taking the limit x Fb 
we obtain the first boundary integral equation 



= -Vt + Ku 

2 



Here, 



{Vt){x)= [ G{x,y)t{y)ds{y), 

JTb 

is a weakly singular operator and 

(Ku)(x)= f {TyG{x,y)fu{y)ds(y), 
Jrr, 



( 2 . 11 ) 

( 2 . 12 ) 

(2.13) 



is Cauchy-singular. Applying the traction operator Tx yields the second 
boundary integral equation 



It = -K't - Wu 



(2.14) 



where K' is the adjoint of K and 

{Wu){x) = -T^[ iTyG{x,y)fu{y)dsiy), (2.15) 

JTb 

is a hypersingular operator. There are different numerical methods, namely 
collocation, Bubnov- Galerkin, Nystrom and least-square method. With 
the collocation method, using the first integral equation 2.11 one obtains the 
following linear system of equations 



Hu = Gt 



(2.16) 
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where u and t contain the unknown and the known displacements and trac- 
tions in the collocation points. Reordering yields 

Ax = b (2.17) 

with a nonsymmetric and dense matrix A. Using both integral equations 
(2.11) and (2.14) the Galerkin method leads to a S 3 nnmetric but in general 
indefinite system matrix A. For this reason, standard iterative solvers for 
symmetric positive definite matrices may not work. In [12, 14, 18], some 
iterative solvers are tested. It turns out that for the collocation method Bi- 
CGSTAB [25] and GMRES [19] are favourable. 

In the following, we compare collocation and Bubnov-Galerkin method 
with respect to accuracy. The regular integrals in the collocation code are 
computed with a GAUSSian rule, the singular integrals analytically. For the 
Galerkin BEM, we use an implementation [17] where the inner integration 
is performed analytically whereas for the outer integrals a GAUSSian rule is 
used. The non-regular integrals are evaluated analytically. 

As a test example, we consider a circular cavity with radius 1 cm in an 
infinite elastic medium under horizontal tension of magnitude 1^^. The 
problem is analyzed under plane strain conditions, the material parameters 
are Young’s modulus E = 1-^ and Poisson’s ratio i/ = 0.2. Linear trial 
functions are used for the unknown displacements. In Table 2.1, the computed 
horizontal stresses ax,h from the collocation and the Galerkin method at 
the point (0, 1.02) are given for different numbers of regulary distributed 
elements. In Figure 2.1, the geometry of the problem is given. 




Fig. 2.1. Geometry and coarse discretization of circular cavity problem 

n, n' denote the number of elements on two different discretization levels, 
and Kn = Order of convergence with n' < n and En = 

Wx - The Galerkin method yields higher accuracy and higher order 
of convergence (exact solution ax = 2.8663525^^). 




Parallel Coupling of BEM and FEM 



321 



Table 2.1. (Tx,h at the point (0, 1.02) 





1 Galerkin | 


1 Collocation 


n 




Kn 




Kn 


8 


2.7877299 




3.3673909 




16 


2.8632967 


4.69 


3.0131097 


1.77 


32 


2.8655840 


1.99 


2.904475 


1.94 


64 


2.8659372 


0.89 


2.8781045 


1.70 


128 


2.8662519 


2.05 


2.8687889 


2.10 


256 


2.8663456 


3.84 


2.8669831 


2.03 


512 


2.8663524 


4.88 


2.8665075 


2.02 



2.3 Discretization with finite elements 



The displacement-based finite element method is a Galerkin method or 
mechanically the principle of virtual work which for the discretized domain 
without body forces reads 



n « ^ r 

52 / <^h{uh) : ehivh) dQ-Y\ I ihVh dA 

i=l i=l 



(2.18) 



with 

Uh,Vh e Vh = 0on Fu, n/, = w on (2.19) 

ih C L2(ri)^, (2.20) 

and fii is the element domain, n denotes the total number of elements, The 
trial displacements Uh within the element i are approximated by linear shape 
functions iV, written in condensed matrix form as 



Uh = iVw* in fii , (2.21) 

where it* are the nodal displacements. In the same way, the test functions Vh 
are represented. This leads to element stiffness matrices fe* and element force 
vectors /*, which are assembled to the global algebraic system 



Ku = f , (2.22) 

where K is symmetric, positive definite and sparse. This system can be solved 
by various direct and iterative solvers. 

In the case of elaistoplastic materials a linearization is necessary if 
NEWTON-type methods are used. Then, in each time step i (or load step 
i ) the linear system 

(2.23) 

has to be solved. Kriuj^i) and il(uj_^i) are the tangent stiffness matrix 
and the residual force vector at the configuration defined by Uj^^- Here, 
subscripts refer to the load step and superscripts to iteration within the load 
step. 
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3 Parallel implementation 

On distributed memory computers, each processor has access to its own local 
memory only. Data exchange between different processors is done by message 
passing, and the time needed for this interprocessor communication is to be 
considered. For parallelization, data are distributed onto the processors in 
such a way that communication costs should be as low as possible. One 
approach is the domain decomposition method which yields high efficiency if 
it is possible to divide the domain into subdomains of nearly equal amount of 
work. In the case of non-linearities or adaptive mesh refinement, this can be 
extremely complicated. In this paper, we discuss the parallelization by data 
decomposition. It yields an algorithm with very good load-balancing and high 
efficiency as shown by the examples in this section. 

3.1 Parallel implementation of the BEM-code 

The parallel program starts by reading the output file of a preprocessor, and 
then data are distributed amongst the processors. The parallel BEM-code 
consists of three parts: 

- assembling of the system matrix and the right-rand side 

- solving the linear system of equations 

- computing displacements and stresses at interior points 

The parallel implementation of the first and the last part can be done very 
simply in different ways, but solving the system of equations causes high com- 
munication costs. Due to this fact, the data distribution remains unchanged 
during all steps of computation and depends on the implementation of the 
solver. 

3.1.1 Parallel assembling of the system matrix 

For the boundary element method, only a discretization of the boundary 
is needed. This is a small amount of data in comparison to domain discretiz- 
ing algorithms like the finite element method. For 2D-problems it is possible 
to store all data needed for the description of the problem (geometry, bound- 
ary conditions, elements, nodes etc.) in each local memory. As a result, each 
processor can now compute any element of the system matrix or the right- 
hand side independently of the other processors, i.e. without communication. 
In the case of linear elements in 2D linear elasticity we have to compute 2x4 
element matrices. Some of the computations can be done for all 8 entries 
of the element matrix. If the columns of A are distributed these computa- 
tions have to be done four times, in case of a row-wise distribution only 
two times. Another advantage of the row-oriented version is that it is possi- 
ble to compute the strongly singular integrals, which occur in the H-matrix 
of the collocation method by a row summation without communication. In 
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comparison to the row-oriented assembling, the column-oriented version is 
about 1.3 times slower. So, each processor has to perform n/p rows of the 
system matrix and n/p elements of the right-hand side vector which gives a 
well-balanced workload. 

If a direct solver is applied, a wrap-around-distribution is used. In the case 
of an iterative solution, a blockwise storage of the matrix A is performed, see 





a) b) 

Fig. 3.1. Storage scheme a) wrap-around-distribution b) blockwise distribution 



3.1.2 Computing displacements and stresses at interior points 

After solving the system, the computed data, i.e. tractions and displacements 
on the boundary, are distributed on the local memories of the processors. 
Then, each processor is able to compute displacements and stresses at any 
given interior point. If we want to compute results at m points with p proces- 
sors, each has to perform m/p points, and a good load-balancing is achieved. 
The computed results on the boundary and at interior points are collected on 
one processor which sends these data to the host computer where the post- 
processing is done. The collection and broadcasting of data is implemented 
in a tree of processors. 

3.1.3 Parallel implementation of the solver 

We have to solve a linear system with a full and nonsymmetric or symmetric 
but indefinite system matrix. This can be done by a direct method, i.e. LU- 
decomposition. If we employ p processors for the solution of a linear system 
with n equations, each processor receives n/p rows and n/p elements of each 
vector in its local memory. These rows of A are distributed in a cyclic manner 
via the so-called wrap-around-map so that an equally balanced workload can 
be achieved. This distribution is not done via communication but the rows 
are computed locally as needed for the LU-decomposition, i. e., processor 
computes rows with index j with j modp = i. For details, see [I, 9]. 

For the parallel implementation of an iterative solver on p processors, each 
processor receives a block of n/p rows of the matrix A in its local memory, i. e. , 
processor computes rows with index j with (nip) -i < j < {n/p) • (z -h I). 
The most time consuming part of an iterative solver is the matrix vector 
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product. This is performed in a ring of processors. In a C-like pseudo code, 
the product A q = v reads for each processor 

v_local = A_local*q_local; 
tmp^local = q^local; 

for(k=0; k<Number_of ^Processors ; k++){ 

Send tmp_local to the right neighbor; 

Receive tmpl.local from the left neighbor; 

Compute v_local+=A_local*tmpl_local ; 
tmp^local = tmpl_local; 

} 

The communicational amount is of order 0{n), the arithmetical costs are of 
order 0(n^). 

3.1.4 Performance results 



In this section, some results on the parallel efficiency of the BE-code 
(collocation method) are given. To compute the parallel efficiency Ep 






Speedup 

Number of processors 



we use the definition for the parallel speedup Sp 



^ _ Time on one processor 
^ Time on p processors 

As a test example, the problem described in Figure 3.2 is discretized with 
a varying number of elements. The material parameters are E = 21000^^, 
1 / = 0.1, and the internal pressure is pr = 750—^. 



I 1 




Fig. 3.2. Bore hole with two radial cracks (plane strain) 
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In Table 3.1 we show some results of the performance of different iterative 
solvers and the LU-decomposition. 

Table 3.1. Performance results for direct (LU) and iterative (Bi-CGSTAB, 
GMRES) solvers, varying granularity, 32 processors 



p=32 


n=400 


o 

o 

oo 

II 

i=! 


n=1600 


n=2000 


Ep(LU) 


0.40 


0.59 


0.80 


0.90 


Ep (Bi-CGSTAB) 


0.54 


0.72 


0.91 


0.93 


Ep(GMRES) 


0.51 


0.71 


0.92 


0.95 



These results show that a considerable efficiency in parallel computing 
can be obtained also for systems of moderate size, i.e. fine granularity. For 
larger systems, the iterative solver becomes superior. In Table 3.2, we give 
results for different numbers of processors for assembling the system matrix 
and the right-hand side. 



Table 3.2. Speedup and efficiency for assembling the system 



P 


1 


3 


4 


5 


8 


12 


16 


32 


Speedup 


1 


2.94 


3.95 


4.95 


7.8 


11.65 


15.6 


31.2 


Efficiency 


1.00 


0.98 


0.99 


0.99 


0.98 


0.97 


0.98 


0.98 



The parallel efficiency and the speedup for the calculation of stresses and 
displacements at interior points are given in Table 3.3. 



Table 3.3. Speedup and efficiency for calculation at interior points 



P 


1 


3 


4 


5 


8 


12 


16 


32 


Speedup 


1 


2.9 


3.8 


4.75 


7.7 


11.3 


15.05 


30.1 


Efficiency 


1.00 


0.97 


0.95 


0.95 


0.96 


0.94 


0.94 


0.94 



The performance of a parallel algorithm depends on the ratio of arith- 
metic work and the amount of communication, i.e. the granularity of the 
problem. Assembling the system matrix, computing the right-hand side and 
the computation at internal points imply very low communication costs. But 
solving the system by LU-decomposition or iterative solvers requires a finer 
granularity for this part of the code. The parallel efficiency of assembling the 
system and of the evaluation of stresses and displacements at internal points 
does not depend ais strongly on the granularity as it is the case for solving the 
system. For the latter, the efficiency slows down if more and more processors 
are used for a problem of constant size. 



3.2 Parallelization of FEM 

A displacement-based finite element program consists of the following parts: 

— generation of element stiffness matrices and force vectors, 

— assembling and solution of the global algebraic system, 

— calculations of strains and stresses within each element. 
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In nonlinear analysis, these modules interact in an iterative manner. The 
parallelization of FEM is done similarly to the BEM implementation, i.e. by 
data distribution [10, 11]. Since FEM is a domain-discretizing method, the 
amount of data for describing the problem is much larger than for BEM. 
Each processor can receive only a part of the whole data in its local memory. 
Starting with a coarse mesh, macro elements (i.e. problem data concerning 
element patches) are distributed on p processors and then refined regularly 
so that a hierarchical mesh is obtained. Then each processor computes a local 
stiffness matrix for these elements. This can be done in parallel without any 
communication, and each processor assembles a local matrix K^. The global 
stiffness matrix can be written as 

p 

K = Y,BjKiBi, (3.1) 

i=l 

where jB^ is a Boolean matrix but K is not assembled. In comparison with 
the domain decomposition method [8], each processor receives more than one 
macro element (see Figure 3.3) which results in a finer grain algorithm with 
higher communicational amount. But in case of local nonlinearities which oc- 
cur in elastoplaistic FE analysis or adaptive mesh refinement, a better load- 
balancing is achieved. The system is solved with a parallel preconditioned 





Fig. 3.3. Distribution of 256 elements on 4 processors, a) 1 macro element b) 4 
macro elements on each processor 



conjugate gradient method. As a preconditioner we use the concept of hier- 
archical bases [26, 27]. Therefore no additional communication (in comparison 
with the unpreconditioned eg) is needed for this method. 

3.2.1 Parallel implementation of the peg— solver 

In what follows, the parallel implementation of the conjugate gradient method 
is described, for details see [15]. Communication is only needed for nodes at 
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the vertices of macro elements and for computing global scalar products. 
Applying hierarchical bases as preconditioning method, a hierarchical mesh 
as in the multigrid method is required. The refinement is done locally on each 
processor. Then, if Vj is the standard FE nodal basis on the refinement level 
j, the hierarchical basis Sj is constructed by 

^3 — ® ^3 » ( 3 * 2 ) 

where the functions in Vj vanish at the nodes associated with 5j_i, and the 
coarse grid space satisfies 

<^0 = Vo . (3.3) 

This leads to a transformed system matrix 

Km = S^KS , (3.4) 

where S and are basis transformation matrices. Due to the low condition 
number of namely 

k{Km) = 0{\ In h 1'^) (3.5) 

and 

k{Km) = k{S^KS) = k(S'^SK) , (3.6) 

it follows that C~^ = S^S is a very efficient preconditioning matrix. 

During mesh refinement, each processor stores the information concern- 
ing the generated nodes and elements. With this information at hand, the 
required matrix multiplications Sy, S^r can be performed very efficiently. 
In parallel, one computes locally on each processor 

Vi = Sjri. (3.7) 

For the transformations, one proceeds from the finest to the coarsest mesh. 
Because all refinement steps have been performed locally on each processor 
all data are available to compute this step without communication. 
Only on the coarsest grid, there are nodes belonging to more than one macro 
element, i.e. to more than one processor. Now, the additive vector Vi has to 
be transferred to an overlapping one by 

y^j^Bjy,. (3.8) 

i=l 

After this, the back transformation from the coarsest to the finest mesh can 
be computed with 

Wi = Siifi with y = Biy . (3.9) 

Of course summations only have to be performed between processors which 

have common macro element vertices. For regular meshes it is easy to map 
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neighboring elements on neighboring processors, so that no global commu- 
nication has to be performed. The summation of the global scalar product 
can be implemented in a very efficient way in a tree of processors. For more 
complex meshes (e.g. due to adaptive mesh refinement), the mapping can 
be performed by recursive bisection algorithms (see [23]). For the parallel 
implementation of an adaptive finite element code with hierarchical precon- 
ditioning, see [11]. 

3.3 Performance results 

For the kind of data distribution described in 3.2, it is important to analyze 
how the granularity of the distribution affects the efficiency of assembling 
the local stiffness matrices and the solver. Working with more and more 
macro elements on each processor leads to a better load-balancing but to 
less efficiency for the peg because the amount of communication increases. 
Furthermore, there will be a loss of numerical efficiency, since for finer starting 
meshes, the peg needs more iteration steps. 

As a test example, we choose Cook’s membrane problem (see Figure 3.4) 
with different numbers of macro elements on each processor. 




Fig. 3.4. Cook’s membrane problem 

The coarsest mesh is given in Figure 3.3. For this plane strain problem 
with VON Mises plasticity and linear isotropic hardening. The material pa- 
rameters are 

- Young’s Modulus E = 70^^, Poisson’s ratio u = 0.33333 
-H = m.O^,yo = OA5^. 

On the finest mesh, we have 65536 elements, i.e. 131584 unknowns, and 32 
processors are used. In the case of elastoplasticity, the best distribution of 
macro elements depends very strongly on the element type. If the evaluation 
of the element stiffness matrix is cheap, a worse load-balancing during the 
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assembly has a smaller eflFect on the overall efficiency of the algorithm than 
the loss in efficiency in the pcg-solver. But if higher-order elements with 
highly non-linear material laws are used, the load-balancing becomes more 
and more important. 



Table 3.4. Efficiency for assembling the stiffness matrix (32 processors) 



time step 


14 


15 


16 


17 


18 


19 


20 


1 macro element 


0.98 


0.93 


0.89 


0.87 


0.85 


0.84 


0.80 


4 macro elements 


0.99 


0.96 


0.93 


0.92 


0.92 


0.94 


0.95 


16 macro elements 


0.99 


0.97 


0.96 


0.97 


0.98 


0.96 


0.96 



Table 3.5. Efficiency and Speedup for solving the system (32 processors) 



macro elements 


Efficiency 


Speedup 


1 


0.90 


28.8 


4 


0.85 


27.4 


16 


0.80 


25.6 



4 Coupling of BEM and FEM 

4.1 Multiplicative Schwarz method in two overlapping domains 

As a coupling algorithm, we use the multiplicative Schwarz method in two 
overlapping domains [21]. See Figure 4.1 for the notations. denotes the 
FE-domain, i ?2 denotes the BE-domain. Then, the iteration scheme reads 

Fi = dOi 
A,c = A n Q2 
r-2,c = A n i?i 
Q = U 

Fig. 4.1. Overlapping domain decomposition 
0 

iiTtii ='=/i 
with on Fi^c 

A = b 

from 



Start with Ui^ = 
Solve 




Solve 
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with 



= u 



l,c 



on T2 ,c 



Compute on Fi^c 

If the displacements have to be computed at k interior points, G and H 
are 2k x 2n matrices resulting from the discretization of Betti’s formula 
(2.10). The advantage of this iteration scheme compared with other coupling 
methods is that for each subdomain optimal solvers can be implemented, and 
the parallel BEM- and FEM-codes can be used without changes. The system 
matrix A remains the same during the nonlinear computation. Note that the 
evaluation of the matrices G and H is performed only in the first iteration 
step. 

In the following, we assume linear elastic behavior in i ?2 and an elastoplastic 
material in i?i. The nonlinear problem is solved by Newton’s method. In 
each step of the Newton algorithm, one iteration step of the multiplicative 
Schwarz method is performed. The iteration scheme then reads: 



Solve Kt Au\ — f = 0 
with u\ 



Solve Ax 2 = b with u 





= u\ ^ 




\ul ^ 


A,c 




A,c V ' 


a u 


II 

0 


A,c 



= 0 if i = I 



(4.1) 

As a test example the elastoplastic behavior of a square block compressed 
by two opposite perfectly rigid punches is studied. The problem is analyzed 
under plane strain conditions and the material is assumed to be perfectly 
plastic according to Section 2.1. The problem data are 

- Young’s modulus E = 6890“^, Poisson’s ratio i/ = 0.33, 

- yield stress yo = 0.89^, 

- hardening parameter H = 0, 

- pressure py = 25^, 

- 10 load steps. 



In Figure 4.2, a coarse discretization is shown. For the parallel computation, 
5184 finite elements, 316 boundary elements and 215 interior points are used. 
In Figure 4.3, the plastic zone after 10 load steps is shown. 

For the computing times on different numbers of processors, see Table 4.1. 

Table 4.1. Computing time in seconds, varying processor numbers p 



P 


8 


16 


32 


Time 


197.7 


116.3 


82.5 


Efficiency 


0.90 


0.77 


0.54 



In comparison to the pure BE- and FE-computations, the parallel effi- 
ciency is now lower due to the fact that data exchanges are needed and the 
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Fig. 4.2. Geometry and coarse discretization for rough punch problem, FE 
discretization in i?i , BE discretization on A = dQ '2 



in U 




n e lO 20 30 



Fig. 4.3. Total spread of plastic zone 



BE-system is solved only in the first iteration step. For further load steps, 
only a back- and forward substitution is performed which requires less CPU- 
time but is of lower parallel efficiency. 

In [12], numerical experiments concerning different sizes of overlaps are 
reported. Larger overlapping implies faster convergence of the multiplicative 
Schwarz method but higher computational amount in each iteration step, 
too. 



4.2 Symmetric coupling of Galerkin-BEM with mixed finite 
elements 

In comparison to standard FEM, one of the main benefits of boundary ele- 
ments is the high accuracy in stress computations. An alternative approach 
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is to use mixed finite elements with stresses rather than displacements as 
primary unknowns. By these means one may also overcome locking when 
Poisson’s ratio v approaches 0.5. 

For simplicity, we consider the linear elasticity problem 

div(7 = -f in i? 

cr — Ce{u) = 0 in 

tx = 0 on 

crn = 0 on Tt 

on the domain i? = fip U F* U Hb in d = 2 or d = 3 (see Figure 4.4). 



> (4.2) 



Fu 



Qf 

FBM 



Qb 

BBM n 



n 



n 



Fig. 4.4. Notation 



We assume that the body load / € L 2 (i?)^ vanishes on for simplicity 
and that the Lame coeSicients are constant on 

The mixed FEM is based on an extended complementary Hellinger- 
Reissner principle. Thus the displacement u and the stress cr are independent 
unknowns. Moreover, the stress tensor a is not a priori assumed to be sym- 
metric. Symmetry will be enforced in a weak sense by a Lagrange multiplier. 
The second equation in (4.2) is weighted by test stresses r with rn = 0 on 
Pt: 

I r:C~^crdf2— j r:e{u)df2 = 0. (4.3) 

J fijr J f2p 

Integration by parts yields 




' 7-1 



a dfi + div T •udQ + / r :ydf2 = 

Jof Jof Jr 



rn • u ds 



(4.4) 



with a third unknown 

representing rotations. The variational formulation in the FEM domain Op 
is given by 





Parallel Coupling of BEM and FEM 333 



Vd/2 + /„^divr-Md/2 + J^^r-.-ydO 




= Jp rn • (f) ds 


Vr e Hdiv 


/fiFdivo--vdf2 = -J^^f-vdn 


Vu G £ 


f^^cr-.Tjdn = 0 


Vt 7G W 




(4.5) 


with the unknowns 




cr e Tidiv ■— {t £ ; divT e L2{QfY, tti 


= 0 on Ft} 


ueC.= L2 {QfY 




7 e W := {t 7 G L2{QfY"'‘^ : = 0}, 





and (f) denoting displacements on F. Udiv is a Hilbert space with the norm 

Our objective is to establish a canonical coupling with Galerkin BEM. 
Since the displacements u are not in we cannot insert the traces 

into the boundary integral equations as it is done in the classical symmetric 
coupling, see Costabel [5]. The condition a G 'Hdi\ implies that the tractions 
an on F lie in H~^/‘^{F) and can be inserted into the boundary integral 
equations. In detail, we proceed as follows: For the displacements (f) on the 
interface F we insert a boundary integral expression, namely 

(j> = -V{an) + -\- K)<l) 

which is obtained from the first integral equation (2.11). To obtain a symmet- 
ric coupling, also the second integral equation (2.14) is used on the interface. 
The coupled formulation is a typical saddle point problem: Find (cr, </», u, 7) € 
'H^iyXH^/‘^{F)xCxW such that for all (r, xH^/'^{F)xCxW 



a(cr,</);T,'0) -h6(r;tx,7) = 0 1 

b{a]v,T)) = -J^^f-vdO J 

with the bilinear forms 

a{a, (par.'ll)) := / r:C~^adf2 

J f2p< 

+ (rn , V (crn)) - {rn , {\l + K)<j>) 



and 



b{a;v,T]) ■.= / diy cr-vdO+ / cr \r)dQ. 

J Qp J Qp 



The bilinear form a is symmetric, i.e. 



(4.6) 



a(cr, <t>\ T, tp) = a(r, ip-, a, <p) . 
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Exploiting properties of the boundary integral operators and applying 
Brezzi’s theory of saddle point problems we obtain the following theorem. 

Theorem[3]: For every f e the saddle point problem (4.6) has 

a unique solution satisfying 

IklUaCfir) + I|div<r||i2(i2p) + + ll7llL2(fiF) 

<C\\f\\L,iO.) 

with a constant C independent of /. 

For the discretization of (4.6) there are several possibilities, including a 
family of mixed finite elements proposed and analyzed by Stenberg [24]. In 
[3] we prove optimal order convergence of the coupled method. 

In the FEM domain fip the tractions across element boundaries must 
be continuous. Our coupled discrete scheme also yields continuous tractions 
across the FEM-BEM interface. In contrast, continuity of displacements 
across the interelement sides and the interface is satisfied in a weak sense 
only. On the surface of the BEM domain, the displacements are continuous. 
Continuity of tractions is achieved by Lagrange multipliers. Then, in the FEM 
domain there are no continuity requirements except for the Lagrange multi- 
pliers. Hence all other unknowns can be eliminated on each element before 
assembling the global linear system, which, in its final form, can be written 
as 




Here, the degrees of freedom in the Lagrange multiplier are partitioned 
into Ap and the latter denoting the unknowns on the interface F. The 
discrete tractions and displacements in the BEM are denoted by T and 
respectively. The submatrices Ay, Ap and Aw arise from the boundary 
integral operators. The matrix C is a simple mass matrix. For more details 
on the implementation we refer to [4]. 

In the following example, we investigate the stress distribution around two 
neighboring cavities in an infinite elastic continuum under uniform vertical 
pressure of magnitude 1. The two cavities have circular cross-section and 
radius 1. The distance between the cavities is 1 (cf. Figure 4.5). For this 
plane strain problem the material parameters are Young’s modulus E — \ 
and Poisson’s ratio v = 0.2. Body forces are neglected. The discretization 
involves lowest order mixed finite elements and linear boundary elements (cf. 

[4]). 

Figure 4.6 shows the normal stress Gy across the line AB (cf. Figure 4.5). 
The mesh refinement is performed uniformly. The highest pressure occurs 
at the points A and B and is in good agreement with the analytic solution 
-3.264 [20]. 
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BEM mesh -b— 




Fig. 4.5. Two neighboring cavities in an infinite elastic continuum, mesh 
with 416 mixed finite elements and 40 boundary elements 



In [3] we investigated examples for which solutions are available in closed 
form and present experimental orders of convergence. 

A posteriori error estimators for mixed elements including Stenberg’s low- 
est order method [24] are studied in [2]. As our results in [4] confirm, the cou- 
pling with adaptively refined mixed finite elements is very efficient, especially 
for the treatment of corner singularities. 



5 Conclusions 

We presented parallel algorithms for boundary and finite element methods 
and their implementation on an MIMD parallel computer. The results show 
that a parallelization by data decomposition yields high parallel efficiency for 
BE and FE computations. 

For the coupling of the two methods, two different schemes are discussed. 
The multiplicative Schwarz method in two overlapping domains is very well 
suited for parallel implementation because the efficient codes from pure BE 
and FE computations can be used with low additional data exchange. For this 
method, there is still the need of theoretical investigations especially in the 
non-linear case. The convergence depends on the overlap of the two domains. 
A larger overlap leads to faster convergence but higher computational costs. 
If the overlap is small many Schwarz iteration steps have to be performed, 
but because optimal solvers for each subdomain can be used, the method 
remains very efficient. 

To solve linear systems, iterative methods often are superior to GAUSSian 
elimination, in particular on parallel computers [14]. When using iterative 
solvers Galerkin BEM becomes attractive since it yields symmetric matri- 
ces. The authors established a symmetric coupling of Galerkin BEM and 




336 



U. Brink, M. Kreienmeyer, E. Stein 




mixed finite elements of the Raviart-Thomas type. We presented theoreti- 
cal and computational results for problems of linear elasticity. A correspond- 
ing discretization for nonlinear problems is subject of further research. 



References 



1. Bader, G., Gehrke, E. (1991): On the performance of transputer networks for 
solving linear systems of equations. Parallel Computing 17, 1397-1407 

2. Braess, D., Klaas, O., Niekamp, R., Stein, E., Wobschal, F. (1995): Error 
indicators for mixed finite elements in 2-dimensional linear elasticity. Comput. 
Methods Appl, Mech. Engrg. 127, 345-356 

3. Brink, U., Carstensen, C., Stein, E. (1996): Symmetric coupling of bound- 
ary elements and Raviart-Thomas-type mixed finite elements in elastostatics. 
Numer. Math., in print 

4. Brink, U., Klaas, O., Niekamp, R., Stein, E. (1995): Coupling of adaptively 
refined dual mixed finite elements and boundary elements in linear elasticity. 
Advances in Engineering Software 24, 13-26 

5. Costabel, M. (1988): A symmetric method for the coupling of finite elements 
and boundary elements. In: Whiteman, J.R. (ed.) The Mathematics of Finite 
Elements and Applications IV, MAFELAP 1987. Academic Press, London 

6. Gruttmann, F., Stein, E. (1988): Tangentiale Steifigkeitsmatrizen bei der An- 
wendung von Projektionsverfahren in der Elastoplastizitatstheorie. Ing.-Arch. 
58, 15-24 

7. Haase, G., Langer, U., Meyer, A. (1990): A new approach to the Dirichlet do- 
main decomposition method. Proc. ’5th Multigrid Seminar’, R-MATH-09/90, 
Karl-Weierstrass-Institut fiir Mathematik, Berlin 





Parallel Coupling of BEM and FEM 337 



8. Haase, G., Langer, U., Meyer, A. (1992): Parallelisierung und Vorkondition- 
ierung des CG-Verfahrens durch Gebietszerlegung. In: Bader, G., Rannacher, 
R., Wittum, G. (eds.) Numerische Algorithmen auf Transputer-Systemen. 
Teubner Skripten zur Numerik. Teubner, Stuttgart 

9. Klaas, O., Kreienmeyer, M., Stein, E. (1993): Numerische Losung ebener Prob- 
leme der lineaxen Elastizitatstheorie mit der direkten Randelementmethode 
auf einem MIMD-Parallelrechner. In: Baumann, M., Grube, D. (eds.) Paral- 
lele Datenverarbeitung mit dem Transputer. Springer, Berlin Heidelberg New 
York, pp. 265-274 

10. Klaas, O., Kreienmeyer, M., Stein, E. (1994): Elasto-plastic finite element 
analysis on a MIMD parallel-computer. Eng. Analysis — Int. J. Comp. Aided 
Eng. Software 11, (4), 347-355 

11. Klaas, O., Niekamp, R., Stein, E. (1995): Parallel Adaptive Finite Element 
Computations with Hierarchical Preconditioning. Int. J. Comp. Mech. 16, 45- 
52 

12. Kreienmeyer, M. (1995): Kopplung von Rand- und Finite- Element-Methodsn 
far ebene Elastoplastizitat mit Implementierung auf Parallelrechnern. Doctoral 
Thesis, Universitat Hannover. Submitted 

13. Kreienmeyer, M., Stein, E. (1993): Parallel implementation of BEM, FEM and 
coupled FEM-BBM on a transputer network. IBNM-Bericht 93/9 

14. Kreienmeyer, M., Stein, E. (1995): Parallel implementation of the boundary 
element method for linear elastic problems on a MIMD parallel computer. Int. 
J. Comp. Mech. 15, (4), 342-349 

15. Law, K.H. (1986): A parallel finite element solution method. Computers & 
Structures 23, (6), 845-858 

16. Lubliner, J. (1990): Plasticity Theory. Macmillan Publishing Company, New 
York 

17. Maischak, M. (1992): Eine Randelement-Diskretisierung und projiziertes SOR- 
Verfahren fiir ein Kontakt problem der Elastostatik. Diplomarbeit, Institut fiir 
Angewandte Mathematik, Universitat Hannover 

18. Romate, J.E. (1993): On the use of conjugate gradient-type methods for bound- 
ary integral equations. Int. J. Comp. Mech. 12, 214-233 

19. Saad, Y., Schultz, M.H. (1986): GMRES: A generalized minimal residual 
method for solving nonsymmetric linear systems. SIAM J. Sci. Statist. Corn- 
put. 7, 856-869 

20. Sawin, G.N. (1956): Spannungserhohung am Rande von Lochern. VEB Verlag 
Technik, Berlin 

21. Schwarz, H.A. (1870): Vierteljahresschrift der Naturforschenden Gesellschaft 
in Zurich, Band 15 

22. Simo, J., Taylor, R.L. (1986): A return mapping algorithm for plane stress 
elastoplasticity. Int. J. Num. Meth. Eng. 22, 649-670 

23. Simon, H.D. (1991): Partitioning of unstructured problems for parallel pro- 
cessing. Comp. Systems Eng. 2, 135-148 

24. Stenberg, R. (1988): A family of mixed finite elements for the elasticity prob- 
lem. Numer. Math. 53, 513-538 

25. van der Vorst, H.A. (1992): BI-CGSTAB: A Fast and Smoothly Converging 
Variant of BI-CG for the solution of Nonsymmetric Linear Systems. SIAM 
J. Sci. Statist. Comput. 13, 631-640 

26. Yserentant, H. (1991): Hierarchical bases. In: O’Malley, R.E. (ed.) ICIAM 91. 
SIAM, Philadelphia. 

27. Yserentant, H. (1986): On the Multi-Level Splitting of Finite Element Spaces. 
Numer. Math. 49, 379-412 




Error Analysis for the BEM on Nonsmooth 
Curves 

T. Hartmann and E.P. Stephan 



1 Introduction 

In the following several methods for approximating the solution of integral 
equations on polygons are considered. These methods and the techniques 
of proving stability and error estimates have to take care of the fact that 
the solutions of the equations considered contain singularities. In Section 2 
we derive stability conditions for a quadrature method using splines for a 
weakly singular integral equation. The same equation is considered in the 
next two sections. In Section 3 collocation methods using Jacobi weights and 
polynomials were extended from an interval to a polygon and in Section 4, 
local error estimates for Galerkin’s method are given. In Section 5 a discrete 
collocation method using a nonlinear transform is considered for solving a 
hypersingular integral equation. 



2 Operator algebra methods 

In the following we derive stability conditions for the quadrature method of 
the single layer potential on a polygon F 

Vru{x) := -- [ log \x - y\u{y) dsy = f{x) x £ F (2.1) 
7T Jr 

with piecewise constant trial functions using a uniform mesh. If not explicitly 
mentioned, the proofs and further details are given in [10]. Let the Cauchy- 
singular integral operator Sp and the double layer potential Kp he defined 
as 

If 1 

Spu(t) — -uMdsr 

m Jp T -t 

Kru{t) f -^\og\T -t\u{T)dsr. 

'K J p uTlr 

Then there holds for t £ F: 

Spu{t) -f Kpu{t) = -iVpu\t). 

Due to this relation, we examine a quadrature method with piecewise linear 
trial functions for the equation Spu Kpu = /. This quadrature method 
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corresponds to a quadrature method with piecewise constant trial functions 
for Vru = f. The equation Sru + Kpu = f is replaced by a discrete operator 
equation RnUn = fn where Rn is an approximation operator of R := Sr + Kp 
acting in the space Xn of piecewise linear functions on a uniform mesh and 
fn ^ interpolates /. Such a method is called stable if Rn is invertible 
for n sufficient large and sup ||i?nll < oo. If the method is stable, f Riemann 
integrable, and Rn converges strongly to R, the approximate solutions Un 
converge to u. Thus, the crucial point is the proof of stability of the scheme. 
This is done by showing stability of a corresponding model problem on an 
angle, making use of a localization principle by Gohberg and Krupnik, e.g. 
[8]. Therefore we consider only the case of F being an infinite angle = 
M+ [j with opening 0 < a; < 27 t. The general case of a polygon follows 

by localization arguments. We fix n € N, 0 < e, (5 < 1 and for A: G Z choose 
the quadrature points and collocation points as follows. We introduce 
with Ak 1 if k > 0 and A^ := — if A: < 0: 



Fn = 

n 



k A 

n 



By using a rectangular rule as the quadrature formula we obtain for a dis- 
cretization of 



Sp^u + Kp^u = / 



( 2 . 2 ) 



the system 



- icot(7r(e - 8))u(tl) - -^Re | ^ = /(r^)- (2.3) 

If there exists a solution (u(t^)),A: € Z we consider a piecewise linear ap- 
proximation for the solution u € L'^iF^) of (2.2). This approximation is 
represented by the usual ’hat ’-functions 

Unit) = ^ mit)- 
kez 

We obtain for the values Un{t'^) 

= + keZ (2.4) 

and in order to determine the approximate solution Un we first have to solve 
(2.3) and to calculate the coefficients Q with (2.4). Now we have to show that 
for each / e L^^iF^) there exists a unique solution of this procedure. To prove 
this, some results on Toeplitz operators are needed which are due to Krupnik, 
see e.g. [9]. Let A denote the smallest algebra containing all Toeplitz operators 
T(a) with a e PC{Fl) (PC{F[) denotes the class of piecewise continuous 
function on the unit circle 77). Then Anxn C L{F)nxn is an algebra of 
continuous operators in /^xn- There exists a multiplicative linear mapping 
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^nxn ^ B Ab into the algebra of bounded nxn-matrix functions over 77 X 
[0, 1]. The symbol 4b of B = Bkj € A, is equal to 

Furthermore, B G 4nxn is a FVedholm operator if and only if det Ab{t, h) ^0 
for all r G 77 and 0 < /i < 1. 

Let Si , Ki denote the parts of the matrix 77i corresponding to the operators 
Sr^ 5 Kr^ . The symbol for S\ was calculated by Rathsfeld and Prossdorf [15] 
and for K\ by Costabel, Ervin and Stephan [2]. With the definitions 



Zq 






^ sin( 7 T(e - (5)) usx<i 



the symbol of R\ has the following form: 






P~^{t) 0 

0 

1 _ 2/1 icos(7T— g;)zo 

^ sin nzo 

—icos(n—u)zo ey ___ -j 

sin Tvzo ^ 



r 1 0 < /X < 1 

r = 1 0 < < 1 



Further development of the symbol yields the following theorem: 

Theorem 2.1. 7. The operator R\ G L{t^) is a Fredholm operator with 

index zero ^ e ^ 6 and S / 1/2. 

2. The operator R\ is invertible in L{r), if —I ^ where ^ is an at most 
countable set ^ CC\0 with accumulation points only in Q. 

3. If u = 7T, then ^ = 0. 



Now a quadrature method on a polygon can be introduced. Using a 
quadrature formula due to (2.3) it turns out that this method will be stable 
if the corresponding model problem for each angle of the polygon is stable, 
i.e. Theorem 2.1 is fullfilled. However to check assertion 2. is still an open 
problem. For further details see [10]. 



3 Collocation methods using Jacobi polynomials 

We consider the integral equation (2.1) in the case that F = [-1,1] is an 
interval 

1 

Vi'ipix) := — / log \x - yl'i/jiy) dy = f{x) x e F 
7T 7-1 

and rewrite its solution xj) — Q<j> using a function q{x) (1 - For 

xp we expect a singular behaviour near the endpoints of the form 
where r denotes the distance to the corner. Since the function g contains the 
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singularity of the solution we expect 0 to be more smooth than xj;. Now xjj is 
approximated by 

N-l 

XpN = Q4>N = ^ (^- 1 ) 

k=0 

Here, Tk denote the kth Chebychev polynomials of first kind and akN unkown 
real numbers. In order to obtain error estimates, we define non-standard 
Sobolev spaces. For / = we define 

oo 

ii/iiL == E + i/(o)i' 

k=l 

and the spaces to be the closure of the set of all polynomials with respect 
to the norm H • Using these spaces and norms stability and error esti- 
mates for the Galerkin method (3.2) and the collocation method (3.3) can be 
obtained. These spaces and techniques are introduced by Sloan and Stephan 
in [18] for a weakly singular integral equation. They are further applied for 
a hypersingular integral equation [7] and the airfoil eqution [12]. 



Find such that ( Vix/j^, ^)q = { E F^v-i (3.2) 

Find xj)^ such that Vixj;^{xjN) = j = 1, (3.3) 

Here xjn = cos denote the zeroes of the Nth Chebychev polynomial. 
The scalar product (•, *}^ is given by 

{ f ^9 )q-= j ^ f{3:)gix)Q{x) dx. 

Theorem 3.1. [12] [18] Let f G Then for N > Nq there exists a 

unique solution of (3.3) and (3.2), respectively. Moreover, if t > s there 
holds ( C is a constant independent of N): 

U-<l>%\\s,o<CN^-*\\f\\t+i,o s<t 

0<s<t. 

These assertions are confirmed by numerical results [18]. 

The integral equation 

Vr^(i) = -- f log |r - t|^(r)ds^ (3.4) 

7T Jr 

on a semi circle F = {t\t = < x < 1} with '4>(x) ;= 

t = can be written as 
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Vrm - Vii>{x) + K^Pix) = g{t) 

Kij}{x) ;= -- f log ^ ^ — —'ij){y)dy 

7T y_i x-y 

i.e. the integral equation on the semi circle can be transfomed in an integral 
equation on an interval [-1, 1] with an additional compact operator K. 

Theorem 3.2. Let f £ • Then for N > Nq there exists a unique solu- 

tion = Q(I>N of 

Vi'lpNiXjN) 4- K%l)N{XjN) - fi^jN) j = 1, 

Moreover, if t > s there holds ( C is a constant independent of N): 

U-<l>N\\s,e<CN^~'\\f\\t+i,e 0<s<t. 

Further we use a two level algorithm, which is proposed by Berthold, 
Hoppe and Silbermann for integral equations on intervals [1]. Therefore the 
coefficients akN of (j)N = 12k=o ^kNTk are calculated in the following manner: 

1. Choose M > 0 with M « N. 

2. Solve the collocation equations VixIjn{xin) = fi^m) for / = 

with ipN — Q^k=o ^kNTk- Due to the mapping property of the integral 
operator this yields a diagonal matrix. Take these coefficients bkN for 
k> M as coefficients a^N of the approximate solution 0iv. 

3. Solve the collocation equations for i = 1, . . . , M 

ViiIjm{xim) -f KtjjMixiM) = fi^m) - VixI>n{xim) 

with OkAfTk, where x[)n = QYk=M^kNTk denotes the 

part of the solution which has been calculated in step 2. This yields a 
system of linear equations of dimension M which are solved using Gau6’ 
procedure. Take the coefficients CkM with k < M as coefficients akN of 
the approximate solution 0]v. 

The two level algorithm yields the same convergence rate with less costs. If 
M can be choosen as « N the costs are of the order 0{N log N). 

Let T be a polygon with straight line segments Fj, F = Y^=i corners 
Tj and interior angles Uj. The solution of the integral equation Vr'ip = f 
defined by (2.1) has a singular behaviour near the corners tj as , where r 
denotes the distance to the corner and aj is defined by 



Qj := min 



£. ^ \_i. 

Uj ’ 27T - Uj J 



(3.5) 



The singular behaviour depends on the angle and can be of the form r'^ with 
-1/2 < 7 < 0. According to the collocation methods for weakly singular 




344 



Hartmann T., Stephan E.P. 



integral equations on an interval or on a smooth open curve we approximate 
the solution ^ of VrV’ = / at each straight line segment Fj hy a weight func- 
tion multiplied by a polynomial. Thus we can write for the approximation 

IpN- 

= 4nH (3-6) 

j=l j=l k=0 

where xpjsj = are real numbers and are certain polynomials. 

The collocation points on the segments Pi are denoted by {x\^) with I = 
1, N. Thus, the collocation method can be described as: 

Find ipN, i.e. real numbers a;^^, for and k=l,...,N, such that: 

J N-l 

^ ^ {y^kN^k) (^in) = fi^m) ^ •••? J ^ = Ij (3.7) 

A ;=0 

Different choices of the collocation points xin^ the weight functions and the 
polynomials are considered in [10, 13]. Further fast algorithms due to [1] 
are tested in [10, 13], for brevity we refer to these papers. 



4 Local error estimates for Galerkin’s method 

In the following we consider Galerkin’s method for solving Symm’s integral 
equation (2.1) on a polygon. It is well known that the behaviour of the approx- 
imation depends heavily on the characteristics of the settings of the problem. 
Non-smoothness of domains and singularities of given data produce singu- 
larities of the exact solution, and reduce the global rate of convergence of 
the approximations. It is then natural to ask whether the accuracy of the 
approximation is better in regions where the exact solution is smooth. 

For strongly elliptic integral equations on a smooth and closed curve in 
, local error estimates for the Galerkin method were proved by Saranen 
[16]. With adaption of the techniques, the estimates were proved for the case 
of open surfaces in E^ by Stephan and Tran [19]. The proofs of the following 
theorems concerning the case of a polygon are based on these works and can 
be found in [14]. 

Let be any fixed straight line segment of the polygon P with nested 
subsegments Pq and P* 

Po CC P* CC and let d := dist{dP^,dro) 

where X CCY means that the closure of X is contained in the interior of Y. 

Since local error estimates are only valuable when the exact solution of 
the equation is in some subsegments smoother than on the whole polygon P, 
it is worth considering local regularity of the solution. This is shown in the 
following theorem: 
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Theorem 4.1. Let f E H^{F^) with s > 1/2. Then there holds for the 
solution xj) of Vrx/j = f 

^ C'{ll/IU*(r.) + IIV’llH-i/2(r)}- 

We will consider the following spline spaces 

= {<!>£ SI: supp<^ C Fj} 

Shir.) = {v e ■. v\r, = <t>\r. for some <f> 6 5;;} 

and we are concerned with the error \\xp - xjJh\\t,ro, where xp and xph satisfy 

V- e H-^/^ir) n H%r.) for - 1/2 < s < r 

i’h e Slir.) (4.1) 

{v{^-^h),<l>)^0 for all ^ e (r.). 

Note that we only assume in (4.1) that xph is a spline on the subsegment T* 
which can be arbitrarily extended onto F provided that the extension belongs 
to 



Theorem 4.2. Let xp £ H^(F^) and /3 < -1/2 arbitrary but fixed. Then for 
—r — l<t<s<r and t < r — 1/2 we have 

U - i’hhn < C{h*-*llV-IU,r. + - M 0 ,r} (4.2) 

where a — Q if t < —1/2 and a — —1/2 — t if —1/2 <t. 

Example: If piecewise constant trial functions (r=l) are used we expect 
the global error in the L^ norm to be of order where sq depends 

on the geometry and takes values between 1/2 and 1. In the //-V2-«o norm 
the highest order is achieved which is 0{h^^^). Due to Theorem 4.2 we expect 
a local error in the norm of order 0{h^) for the first term in (4.2) and of 
order for the second term in (4.2). 

A better approximation can be obtained by averaging the values of the 
Galerkin solution, using the K-operator. The action of the K-operator on the 
approximate solution xph is denoted by Kh{xpfi) and is given by convolution 
with a carefully chosen spline This spline is defined by a combination of 
2gr — 1 splines each being a (/ — 1) times convolution of a B-spline. For more 
details see [14, 21]. 

Theorem 4.3. Let / > r + 1 and q > r -{-1/2. For the solution xp ofVxp — f 
there holds xp £ H‘^^^^{F^). Let 0 be arbitrary but fixed. Then there holds 
with C independent of h: 

\\Khi^Ph) - V’llo.ro < C{/i2'-+i||^||2.+i,r. + Uh - V-ll^+r+i.r}- (4.3) 
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Since in (4.1) it is a-ssumed that xjjh is a spline only on a subsegment 
we can use a grid which is uniform on F^ and graded near to the corners. 
This yields an approximation for which the second term in (4.3) is of higher 
order than the first term. In this case we found that the error for piecewise 
constant trial functions is of order 3. 

Further the local error estimates are used to show that the local residual is 
a local error indicator and can be used for constructing adaptive algorithms. 
In the case of smooth closed curves this is shown by Saranen and Wendland 
[19]. With an approach diflFerent from that used in [19] similar results can be 
obtained for polygons. The proof adapts techniques used for finite element 
methods by Johnson and Eriksson, see for example [6], and can be found in 
[10]. 

The strategy of the method considered is to adapt the mesh size to the size 
of the residual so as to equidistribute the element contributions to the global 
error. For a choosen error tolerance TOL the algorithm looks as follows: 

1. Start with a coarse and quasiuniform mesh. 

2. Compute the corresponding Galerkin solution xIjh- 

3. Calculate the residual R :— f - VrV^/i. 

4. If there holds for any element I (N number of elements) 



\\R\\,j<TOLlN (4.4) 



then accept 'ijjh and stop. Otherwise, halve all elements I for which (4.4) does 
not hold and go back to step 2.. 

Let F[ and h denote the global and local meshsize, i.e.. 



H := max is meshsize of an element on F 
h := max is meshsize of an element on T* 



} 

} 



Of course, there holds h < H. 

For this algorithm local efficiency can be proved. 



Theorem 4.4. Assume that 0 < s < 1 and for a positive constant C\ 



llfllli.ro > Cl TOL 

then there exists for sufficient small h a subsegment F^ with 

h^m\s,r. > C TOL. 

Note: For brevity the local error estimates and the K-operator are here 
formulated for Symm’s integral equation only. However, similar results can be 
obtained for more general operators which are pseudodifferential operators 
restricted on polygons, see [14]. 
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5 Discrete collocation using nonlinear transform 



In the following we consider the hypersingular integral equation Wu = f 

^ i /r ^ I'' “ 

on a teardrop shaped domain F with single corner at xq. Following the 
development in [3, 4, 5], we now rewrite (5.1) using an appropriate non- 
linear parametrization 7 : [0, 1] ^ T which varies more slowly than arc- 
length parametrization in the vicinity of xq. Consider a parametrization 
7 o : [0, 1] ^ r such that 7 o( 0 ) = 70(1) = xq and l7o(5)| > 0 for all 0 < s < 1. 
Choosing a grading exponent g € N and selecting a function u such that 

= 0<s<l, (5.2) 



we define the mesh grading transformation 

7(5) = 7o(o;(s)), where u{s) = 



^^{s) -I- 1/^(1 — s) ’ 



The chosen parametrization 7 is graded with exponent q near the corner. The 
simplest choice of u satisfying (5.2) is, of course, i/(s) = s, for further details 
see [4]. 

Using this transform we rewrite (5.1) with {x = 7(5), ^ = 7(<^))j multiply 
with |7'(5)| and achieve for 5 € [0, 1] 

Kw{s) := log lx - 41 IV Wll7'(s)l»«(o-) dn = g(s), (5.3) 

7T uTlx Jq uTl^ 



where 

w{(t) := u{j{(t)), g{s) := l7'(s)l/(7(s))- 

If the right hand side / is smooth, the solution w of (5.3) becomes arbitrarily 
smooth by choosing a grading exponent large enough. Therefore trigono- 
metric polynomials are a good choice to approximate w. Since there holds 
Wu 0 for any constant u, an additional constraint is neccessary to ensure 
a unique solution. We look for solutions w of Kw = / satisfying w = 0. In 
the following we define some operators and spaces which will be used below. 

H^(n :={/€i/*(r) l^/ = 0} 



Aw(s) = -2 f log|2e sin(7r(s — a))\w{a) da, 

Jo 

Bw{s) = Kw{s) - A“^tt;(s) = / b{s,a)w{a) da, (5.4) 

Jo 

Further let Dv{s) = v'{s) denote the derivate of v. The following mapping 
properties can be obtained: 
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Theorem 5.1. The operators A~^K : and K : are 

continuously invertible. Further there holds a Gdrding^s inequality: 

Re{{A~^K + T)v,v) > c||Dt;||o v £ 

with a compact operator T. 

In order to solve (5.3) we choose as collocation points 



Sj = j/i, j = 1, 2n, where h := l/(2n -h 1), 

and denote by the space of trigonometic polynomials (j) of degree < n 
satisfying J </> = 0. We look for an approximation Wh defined by 

Kwh(sj) = g{sj) j = 1, 2n. (5.5) 

For any continuous 1-periodic function v with f v = 0 we define an interpo- 
lation operator Qh onto 7^ by 

(Qhv)(sj) = v(sj), j = 1, 2n. 

Since Qh and A~^ commutate on 7^ we write (5.5) as 

{A~^ + QhB)wh = Qhg, Wh£Jh- (5.6) 

Further we can show for Wh €.T^ 

+ QhB)wh = -D'^Wh + RhA~^Bwh 

using Rh ;= A~^QhA. We derive for (5.6) an equivalent version: 

(-D^ + RhA~'^B)wh = Rhe Wh € 7^, (5.7) 

where e = A~'^g. We can only prove stability for a slightly modified equation. 
Therefore a cutoff operator Tr is introduced: 



Tr{v){s) 



u(s), 

0 , 



s G (r,l -r), 
sG(0,r)U(l-T,l) 



Using the cutoff for (5.7) we consider 

{-D^ + RhA~^BTi.h)wh = Rhe wh 6 Tt- (5.8) 

with fixed i* independent on h. If = 0 then (5.8) and (5.7) are equiv- 
alent. For a discrete collocation method we use the trapezoidal rule and a 
regularization to replace B in (5.6) by 

2n 

Bhv{s) := b{s,aj){v{aj) -v(0)). 

3=0 



and prove stability and error estimates. 
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Theorem 5.2. Let q>2 and i* large enough. Then there holds: 

1. The discrete collocation equations are stable^ i.e 

Wi-D^ + RHA-^B^Ti.h)v\\-i > c\\Dv\\o v € 

for h small enough, with c independent of 1^ and v. 

2. If f is smooth enough and there holds w £ we have the error estimate 

||w - w/i||i < ch^~^ 

with c being a constant depending on i*, but independent of h. 

A similar theorem can be proved concerning the collocation method (5.8) 
without replacing B by 3^. For further details see [11]. 
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The hp- Version of the BEM with Geometric 
Meshes in 3D 

M. Maischak and E. P. Stephan 



1 Introduction 

The solutions of three-dimensional elliptic boundary value problems in poly- 
hedral domains or on screens have special singular forms at corners and edges. 
If those problems are converted via the direct method into boundary integral 
equations, the solutions of the latter inherit those edge and corner singu- 
larities. The singularities disturb the convergence of numerical schemes, e.g. 
finite element or boundary element methods. Numerical schemes converge 
as fast as the solution is approximated by the chosen trial functions due to 
the quasioptimality of the Galerkin method. Thus the rate of convergence 
is determined by the regularity of the solution. Even for smooth data, the 
regularity is reduced by edge and corner singularities, and therefore the con- 
vergence of these procedures is much slower than in case of smooth domains. 

In 2D boundary value problems we have only corner singularities (see 
[2]). From [24, 25, 26] which are based on [5] it is known that the solu- 
tions of three-dimensional elliptic boundary value problems also contain edge 
and corner-edge singularities. While edge singularities can be analyzed using 
one dimensional approximation results (see [8]), the corner-edge singularities 
demand further investigation. Therefore we have defined countably normed 
spaces which contain not only the well-known corner singularities but also 
the edge-singularities and even more the corner-edge singularities. As we do 
not only have point singularities, all approximation results of [7, 8] have to 
be extended and generalized considerably. First results for the hp-version of 
the finite element method for 3D problems have been announced in [3]. Be- 
cause in this case the trace theorem gives no result for screen problems, those 
results are not suitable for the hp-version of the boundary element method 
considered here. Also the mesh defined in [3] is more complicated than the 
simple tensor product mesh defined here (see Figure 3.1) and demands much 
more eflFort to implement. From Lemma 3.1 it can be seen that the singular- 
ities described in [24, 25, 26] are contained in our countably normed spaces. 
From [24, 25, 26] it is known that for opening edge angles a; < 27 t of the poly- 
hedral, i.e. we have no screen problem, the singularities are square integrable, 
which can be analyzed directly using our countably normed spaces. The case 
of screen problems for the Laplacian is analyzed in [9] and for the Helmholtz 
operator in [10]. The case of the Lame operator is analyzed in [17]. 
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In this paper, it will be shown that under proper assumptions usually sat- 
isfied in applications, the hp- version with a geometric mesh for the Dirichlet 
problem of the Laplacian has exponential rate of convergence with respect 
to the number of degrees of freedom. In Section 2 we consider the Dirich- 
let screen problem (for Dirichlet and Neumann problems in polyhedrons see 
[11]). The countably normed spaces introduced in Section 3 are motivated by 
the form of the corner-edge singularities described in [25]. 

In Section 3 we define the mesh and the appropriate countably normed 
spaces. For the proof of our main result (Theorem 3.3 and Theorem 3.4), the 
exponential convergence of the hp- version of the BEM for 3D problems, we 
define an equivalent countably normed space on a reference element using 
cartesian coordinates. In Section 4 we describe some numerical results. 

In the following, C denotes a generic constant. 



2 The screen problem and the Galerkin procedure 

For brevity let us consider the Dirichlet screen problem with F = (0, 2)^: 



Given / on the screen F find u in fir 


= E^\r satisfying 




o 

II 


in 17 j". 


(2.1) 


u^f 


on F 


(2.2) 


and 


as |a:| oo. 


(2.3) 



In [1, 4, 21] this problem is converted into the integral equation 



for the Dirichlet problem. Here V is the single layer potential. 

It is known from [4], [21] that the unique solution of (2.4) belongs to 
a certain Sobolev space of distributions concentrated on F. Namely, given 
/ G H^/‘^{F) there exists exactly one solution ^ e H~^^‘^{F) of (2.4). 
Furthermore, problem (2.1), (2.2), (2.3) and equation (2.4) are equiva- 
lent, i.e. u G solves (2.1), (2.2), (2.3) if and only if the jump 

[^]\r € #-^/2(r) is the solution ^ of (2.4). 

The regularity theory in [25, 26] shows that even for smooth right hand 
side / the solution ^ of the integral equation (2.4) becomes unbounded near 
the edges of T, it has so-called edge and corner-edge singularities. From [25, 
Corollary 5] and [26, Theorem 2.7 and (2.38)] we present the regularity result 
for the screen F = (0,2)‘^: 
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Theorem 2.1. Let F = (0,2)^ and f € H^{F) in (2.4) be given. 

The unique solution S' £ H~^^^{F) of (2.4) has near the origin a decomposi- 
tion into edge and comer-edge singularities of the form 

S' = «Po + 

+ sin“^/^ ^ ^ ~ cos~^/^ 6 (2.5) 

with S'o € H^~^{F), a\ 6 K, u){9) 6 !% ei{r) = bir"^ + Ci, 

r-i/2ci(r) G r-^/^a{r) G L‘^(R+), 6^ G M, t = 1,2. 

Corresponding decompositions hold at the other vertices of F. 

Here (r, 6) are plane polar coordinates concentrated at the origin, x x 
are cut-off-functions with x = 1 for r < 1/4, % = 1 for 0 < 7t/4. The 
exponent 7 0.2966 is obtained in [20]. 

Remark 2.1. The analysis in [25] and [26] shows that for smoother data / 
in (2.4) expansions like (2.5) hold with smoother remainder terms ^0 and 
smoother contributions to the edge and corner-edge singularities. 

It is shown in [4] that the operator V in (2.4) is positive definite, i.e. there 
exists a constant rj > 0 such that for all (p € H~^^‘^{F) 

{V‘p,ip)L^r) ^vWv’fH-u^ry 

It is well known that this property of V guarantees the convergence of the 
boundary element Galerkin solution for (2.4) if conforming, approximating 
subspaces Xn of H~^/‘^{F) are used. 

Then the Galerkin scheme for (2.4) reads: 

Find € Xiv such that for any (p £ Xn 

{V^N,^)L^(r) = if,p)L^{r)- (2.7) 

Application of the general results of the Galerkin procedure for strongly el- 
liptic pseudodifferential operators (see [23]) yields the following result. 

Proposition 2.1. The Galerkin equation (2.7) is uniquely solvable for any 
N. Let^ be the solution of (2.4) and^N its Galerkin approximation defined 
by (2.7). Then there holds 

\\^ — ^ |||^ “ ^llH-i/2(r) j ^ ^ • (^-8) 

Here C is a positive constant independent of N. 
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3 The hp- version with geometric mesh 

Let r be the boundary of a simply connected bounded polyhedron i? in . 
We first introduce appropriate countably normed spaces (0 < ^9 < 1) 

(cf. [6]). Let F be a face of fi having the corner points ei , . . . , e^- For each 
z, fi denotes the edge of F connecting with where the periodicity 
convention Cm+i = ei, /m+i = fi is adopted. Consider a covering of F by 
neighbourhoods C/j (i = 1, . . . , m) of e* not containing ej, j ^ i, and introduce 
polar coordinates {n.&i) of origin ei in Ui such that ri - dist(x,ei) and 
the edges fi-i and fi are given by di = Ui and &i = 0, respectively. For 
0 < ^ < 1, let 

Bp{F) = {u\u e (F) : 3C, d > 0 independent of k such that 

+ (3.1) 

ar ao — k = IJ 1, . . . = 1, . . . ,m}, 

Bp{F) = {u |u E H^~^{r),u E Bp{F) for all faces F of F} 

with (a)+ := max(a, 0). If we would like to emphasize the dependence on the 
constants C, d we will write Bp{F) = Bp Q^^(F), etc. 

Now we define the geometric mesh on the faces of the polyhedron. We as- 
sume that the face F is a triangle. This is no loss of generality because every 
polygonal domain can be decomposed into triangles. We divide this triangle 
into three parallelograms and three triangles where each parallelogram lies 
in a corner of the face F and each triangle lies at an edge of F apart of the 
corners. By linear transformations (fi we can map the parallelograms onto the 
reference square Q = [0, 1]^ such that the vertices of the face F are mapped 
to (0, 0). The triangles can be mapped by linear transformations (fi onto the 
reference triangle Q = {{x,y) ^ Q \ y < x} such that the corner point of the 
triangle in the interior of the face F is mapped to (1, 1) of the reference trian- 
gle. By Definition 3.1 the geometric mesh and appropriate spline spaces are 
defined on the reference element Q. Analogously the geometric mesh can be 
defined on the reference triangle Q (see Figure 3.1). Via the transformations 
the geometric mesh F^ is also defined on the faces of the poly- 
hedron. The approximation on the reference square is the more interesting 
case because it handles the corner-edge singularities. Therefore we deal in the 
following only with the approximation on the reference square. 

Definition 3.1 (geometric mesh). Let Q = [0, 1] x [0, 1]. For 0 < a < 1 
we use the partition of Q into subsquares Rki 

n 

Rki = [xk-i,xk]y<[xi-i,xi], (k,l=l,...,n), Q- [J Rki (3.2) 

k,l=l 



where 
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xo = 0, — k — (3.3) 

With we associate a degree vector p = (pi, . . . o,nd define as 

the vectorspace of all piecewise polynomials v(x, y) on Q having degree pk in x 
and Pi in y on [xk-i,Xk\ x [xi-\,xi\, k,l = l,...,n, i.e. •y|[a;fc_i, 2 ;fc]x[x, _!,*,] € 
Ppk ^p^(Rki). For the differences hk = Xk — Xk-i we have with A = (1 — cr)/a 

hk = Xk-Xk-i = Xk-i{--l) < x{--l) = xX, Vx € [xk-i,Xk] (2 < A: < n) 

(7 G 

(3.4) 

The index r € (0, 1} in determines the regularity of the piecewise 

polynomials, i.e. discontinuity in case of r = 0 and continuity in case of 
r = 1. 




Fig. 3.1. Geometric mesh with cr = 0.5 on a polyhedral face 



Now we define countably normed spaces on the reference element Q using 
cartesian coordinates 

Definition 3.2 (countably normed spaces Bj^{Q)). 

Let /3 he a real number with 0 < ^ < 1. The weight function = 

^p,a,l{x,y) is for a = (cxi,a 2 ) and an integer I > 1 defined by 



miii(ai — l,ai+a2 — 0 min(a2”l,Q:i+a2 — 0 

7i=max(ai— /, 0 ) 72=max(a2— ^, 0 ) 



^Q!i+q;2— 72 y'Y2 



(3.5) 



Let 



ai“i 



D“ = = 



The countably normed spaces for I > 1 are defined by 
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B0(Q) = ^u:u£H^ ^(<3),ll^/3,a,j£>“u||i2(Q) < C7d* 
for \a\ = k = l,l + I,. . . , 

with C > l,d > 1 independent of k^. (3.6) 

If we would like to emphasize the dependence on the constants (7, d we will 
write B^(Q) = etc. 

Theorem 3.1. [14, 15] LetQ he the reference element and let ip be the linear 
transformation from a parallelogram, which lies in a corner of the face F, to 
the reference element Q. Then it follows for I = 1,2 from 

^ ^ (3-7) 

that we have 

U0(f-^ £ Bj,c,diQ) (3.8) 

where C,d (resp. C,d) are the constants in the definition of Bp{Q) (resp. 

&MQ)))- 

Remark 3.1. In the case i = 1 we have also equivalence. 

Now we show that singularities (which typically appear in solutions of bound- 
ary integral equations on polyhedral domains Q [24, 25, 26]) are contained in 
the countably normed spaces considered here. The arising singularities have 
the form (3.11) near the corners of the faces of Q. Therefore we need count- 
ably normed spaces which describe singularities in the angular variable 0. 

Definition 3.3 (countably normed spaces .B^([0, cj])). 

The countably normed spaces B^([0,a;]) for I > I are defined by 

B^0,cA^M) = {« : « G H*-i([O,a;]),||(0(a; -0))^+“-‘9gu|U2([o,„]) 

< Cd^~\a — l)\ for a = l,l 1, . . . , 

with C > l,d > I independent of a|. (3.9) 

Lemma 3.1 (corner-edge singularities). [18] Let 0 < /? < 1. Let (ak)k 
be a sequence of real numbers. Let {\k)k be a sequence of real numbers with 
Ajk+i - Xk > 0 for all k > 1 and X\ > I - P - 1. Let Wk £ ([0,^]) 

with dk <d <oo for all k>l and \\wk\\w-i([o,u]) ^ ^k- 

limsup 2 |A*+i|-|Afc|^Ak+i-A* < q, < 1 (310) 

k-^oo (^k\o>k\ 

Then we have 

oo 

Y^aA“wkie) e S^,c,rf(5^) 

k=l 

with the sector = {(r, 0) 1 0 < r < /?, 0 < 0 < a;}. 



(3.11) 
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Remark 3.2. ^he restriction of (see (3.1)) to the sector 

on the face F. 

Corollary 3.1. [18] Let 0 < < 1 and v > I - 1/2 -(3, \ >l — I — /3, 

A > —1, 1 / > — 1 / 2 . Then we have for I > 1 

r^0‘' e (3.12) 

The following result is used to estimate the approximation error for smooth 
remainder terms in the asymptotic expansions of the solutions of the integral 
equation (2.4). 

Theorem 3.2. [9^ Lemma 5] Given u G H^{Q) there exists (f) £ 
such that 

h-4>\\LHQ)<cM^-^y* (3.13) 

with C independent of N = dim5^’^((5^) 

The exponentially good approximation properties of splines on our geometric 
meshes for general functions u € Bp{Q) (? = 1 , 2 ) are given by the following 
theorem. 

Theorem 3.3. [18, 19, 14] 

(i) Let u{x,y) E with 0 < < 1. Then there is a spline function 

UN{x,y) £ and constants Ci,b\ > 0 independent of N, but 

depending on (J,ii,/3 such that 

\\u{x,y) - UN{x,y)\\L^(^Q) < Ci e"*** ^ (3.14) 

with p = (pi, . . . ,Pn), Pk = [tt{k - 1)] for p> 0 and N = dim5^’°((3”). 
Moreover un{x, y) can be chosen to be the -projection ofu on 5^’^(Q^). 
(a) Let v{x,y) £ B^{Q) n C^{Q) with 0 < P < 1. Then there is a spline 
function vn € and constants (72,6*2 > 0 independent of N, but 

dependent on a, p, P such that 

\\v(x,y) - VNix,y)\\Hi(Q) < 6*2 (3.15) 

with p = (pi , . . . ,pn), Pi = 1, Pk = max(2, [p{k - 1)] + 1) (k > 1) for 
fi> 0 and N = dim (Q^) . 

Next we give the energy estimate for the Galerkin solution of the integral 
equation (2.4) computed via the hp-version. In the following let Xn = 
5^’0(rj^) denote the subspace of piecewise polynomials on the geometric mesh 
and let Xn he locally defined ais in Definition 3.1. 
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Theorem 3.4. [9, Theorem 2] Let the right hand side f in equation (2.4) 
he piecewise analytic and ^ be defined by (2.4) and € Xn = its 

Galerkin approximation be defined by (2.7). Then, with N = diuiX^, there 
holds for any a > 0 

11^^ - ^n\\h-U2(d < + 0{N-^) (3.16) 

with constants C > 0 and b > 0, independent of N but depending on a, fi 
and OL. 

Proof Due to Theorem 2.1 we need only to consider the behavior near the 
corners and edges ot P = (0,2)^. With respect of the symmetry of P it is 
sufficient to look at the origin. Hence let us consider the most singular part 
of the exact solution near the origin, namely the corner-edge singularity in 
( 0 , 0 ). Since uj{6) ^ sin^/^ 9 we have to approximate the function 

•= sin^/^ 6 + r'^~^ sin“^/^ 6 (7 « 0.2966) (3-17) 

by a spline-function = x/ji xp 2 e The term r"^“^sin^/^0 

t- 7 - 1 ^ 1/2 be approximated exponentially fast by splines in 

the L'^-norm due to Corollary 3.1 and Theorem 3.3. In [9] it was shown that 
the term r^~^ sin“^'^^ 0 can be approximated exponentially feist by splines 
' 4)2 ^ in the H~^/‘^{Q)-noTm 

\\r'^-^sm-^/H-Mx,y)\\H-u^Q)<Ce-^^. (3.18) 

Now the remaining part of namely ^ is smoother than the most 
singular terms of Since / is piecewise analytic, the difference ^ can 
be decomposed into a sum of higher edge and higher corner-edge singulari- 
ties and an arbitrarily smooth remainder term. This can be seen by further 
expanding the functions Vk and in the decomposition (5.4) in Theo- 
rem 8 in [25]. The resulting series expansions for these functions Vk and ^ ^ 
themselves can be splitted into a finite sum of terms belonging to countably 
normed spaces plus a remainder term in H^{Q), for t sufficiently large. These 
terms in the finite sum are of the form 

r^e'' e l = loT 2. 

It is shown in Lemma 3.1 that even infinite series of those terms belong to 
countably normed spaces provided A, u grow sufficiently fast. Now, those 
additional singularities can again be approximated exponentially fast due to 
Theorem 3.3. Thus in view of such an improved decomposition it is left to 
approximate globally a remainder term in H^{Q) where t > 0 is arbitrarily 
large. With Theorem 3.2 we obtain that this remainder term can be approx- 
imated with 0{N~^) with a = {t — l)/4. 

Remark 3.3. For the Neumann problem we get the analogous result using the 
spaces and instead of and respectively (see [9, 14, 19]). 
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Remark 3.4- The analogous results hold also for Dirichlet and Neumann 
problems of the Lame- and Helmholtz-equations (see [10, 17]). 



4 Numerical Results 

Here we present numerical results (performed on the CDC 990 and Siemens 
S400/40 at the RRZN, University of Hannover) for the Dirichlet problem of 
the Laplacian on a screen. We consider the square F with sidelength 2. We 
choose the function 1 as the right hand side, i.e. / = 1. For the Dirichlet 
problem we look at the capacitance C of the screen T, which is given by 

(«) 

where the charge density ^ solves the integral equation 

( 4 - 2 ) 

In [1] a numerical value of 0.733576 in case of the square plate is given for 
the capacitance C. One obtains an approximation Cn to the capacitance C 
by inserting the boundary element Galerkin solution for (4.2) into (4.1). 
Because of the special right hand side f — \ there hold the inequalities 

ci||«P - (4-3) 

for constants c \ , c *2 > 0 and therefore the error estimates 

|C-(7w|<C3| (4.4) 

for the pure h- and p-versions (see Lemma 1.6 in [1] for the h-version and 
[22] for the p-version). 

Due to Theorem 3.3 and Theorem 3.4 we expect for the hp-version with 
geometric mesh numerically an exponentially fast convergence. In Figure 4.1 
we compare the errors for the different versions on the square plate: The 
error curves are linear for the pure h- and p-versions and curved downward 
for the hp-version with geometric mesh. The convergence of the latter is 
indeed exponential as Figure 4.2 indicates where we obtain nearly straight 
lines by plotting the logarithmic errors in capacitance against the fourth root 
of the number of unknowns. 

Remark 4-F Corresponding computations have been done for the Neumann 
problem of the Laplacian on a screen [9], for Dirichlet and Neumann problems 
on polyhedrons (cube, L-block) [16], for the Lame equation [17] and the 
Helmholtz equation [10] with analogous results. 
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Fig. 4.1. Dirichlet problem on the screen F in 



Error in Capacitance 




Fig. 4.2. Dirichlet problem on the screen F in R^ 






The hp- Version of the BEM with Geometric Meshes in 3D 361 



Remark 4-8. All numerical investigations have been done using three pro- 
grams, namely maibemS.f, maibem2.f, maifem2.f . The programs are con- 
figurable using a common control language, i.e. all implemented operators, 
solvers, geometries, etc. are usable in every meaningful combination. 

maibem2.f realizes the 2D-BEM on polygonal boundaries, maifem2.f re- 
alizes the 2D-FEM on domains with polygonal boundaries and maibemS.f 
realizes the 3D-BEM on polyhedral boundaries. 

We have developed three libraries of subroutines (liblapS.f, liblameS.f, lib- 
helmS.f), which include the analytical computation of the Galerkin elements 
and potential functions on rectangular and parallelogram elements for an 
arbitrary hp-distribution (see [12]). Analogously we have developed similar 
libraries for the 2D-BEM on arbitrary polygonal boundaries (see [13]). 
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A Transonic Panel Method for Helicopter 
Flows 

S. Wagner and A. Rottgermann 



1 Introduction 

1.1 General Remarks 

Since the early 1960’s, boundary element methods^ have been an indispens- 
able tool for analyzing the flow fleld and shaping the surfaces surrounded by 
flows. After Hess & Smith [17] introduced the panel methods into fluid me- 
chanics, many different methods have been developed [31, 32, 41]; surveys of 
the existing methods are given by Wagner [45, 46] Hoeijmakers [18] and Hess 
[16]. With the increasing use of computers, lifting surface theories [43, 44], of 
which some had been very sophisticated until that time, increasingly lost im- 
portance. The scope of application for boundary element methods, however, 
was extended to include supersonic and unsteady flows. 

Reasons for the success of the boundary element methods are: 

o the capability of calculating accurate solutions of the linearized potential 
flow of very complex conflgurations including the vortex layer 
o an acceptable expenditure for the panel discretisation achieved by reducing 
the problem by one dimension to the body surface 

The competing finite-difference, finite- volume and finite-element meth- 
ods need space networks around the configuration that usually contain many 
grid cells. Thus, the grid generation and the solution of the flow problem in 
this grid require a lot of computation time. By novel solution procedures for 
convergence acceleration and new grid generation strategies, these methods 
can also be employed on more complex geometries. 

To remain competitive, panel methods have to be further developed as 
well. The boundary element methods will only have future prospects if two 
of its greatest restrictions can be overcome: 

1) The computation expenditures increase quadratically or even cubically ac- 
cording to the number of the used surface elements 

2) Panel methods are based on linear integral equations and thus can calculate 
only incompressible and weak compressible flows 



In aerodynamics they are usually called panel methods. 
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The computation expenditure can be reduced by several methods: multigrid- 
methods, clustering by coarse grid estimations [39], panel-clustering tech- 
niques [11]. 

It is getting more and more important to include the interference effects 
between body and wake vortex layers in calculation, e.g. to study the non- 
linear lift behavior of delta wings with leading edge vortices [1, 19, 24, 42] or 
the influence of the blade tip vortex on the circulation distribution of a rotor 
[8, 22, 35]. In Cctse of rotors, blade-vortex interferences^ and transonic effects 
are very important in the area of the blade tip. The separating vortex layers 
can be especially well described by using vortex-lattice methods. In order 
to capture the compressible and transonic effects, the second restriction has 
to be removed and the flow fleld has to be described by a nonlinear integral 
equation. 

1.2 Objectives 

Main emphasis is placed on developing a method that allows to determine 
the influence of the blade vortex interference and of the compressibility of 
the fluid on the blades of a rotor. 

To capture the development of the free vortex layers and its interferences 
with bodies, a vortex layer procedure [27, 33, 35] based on a panel method 
for delta wings by Behr and Urban was developed at the University of the 
Federal Armed Forces in Munich. For the present work, the method, originally 
developed only for thin foils and external Neumann boundary condition, was 
extended to thick blades and extended by the feasibility of using an inner 
Dirichlet boundary condition. Since the compressible effects of the fluids have 
to be captured as well, it was necessary to surmount the limitation to the 
linear integral equation and to employ a nonlinear integral equation. The 
objective of this work is to prove that transonic effects at the blade tip of 
helicopter rotors can be described by using an extended integral equation 
procedure. 

In industry, panel methods are very successfully used for the flow com- 
putation of complex conflgurations. However, ats a rule, these methods are 
limited to slightly compressible subsonic flows and pure supersonic flows. 
When developing a method that includes compressible flows up to the tran- 
sonic speed domain, the main attention is focused on the easy operation and 
the combination with already existing panel methods. 

The second restriction of the conventional panel methods is the consid- 
erable increase of computation time for panel numbers N > 1000 — 2000. 
To facilitate a more efficient computation of the linear part of integral equa- 
tions, the panel- clustering method^ introduced by Hackbusch & Nowak [11] 
and further developed by Sauter [37] is described in order to reduce the ex- 
penditures on computation and storage. This method considerably increases 

^ BVI — Blade- Vortex-Interference. 
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the efficiency of boundary element methods but up to now has not been doc- 
umented in a form that would be easily comprehensible for most engineers. 

The theoretical principles of the field panel method and further results 
are presented by Rottgermann [34]. 



2 Basic Equations 

2.1 Equations of Motion 



For irrotational flows, a velocity potential can be defined, from which the 
velocity components can be determined by forming the gradient V = -V^, 
and that complies with the condition of irrotationality due to V x V = V x 
= 0. Introducing the velocity potential into the conservation equations 
yields a formula for the velocity potential ^ at compressible, unsteady flow: 

\ J ox^ \ ) oy^ \ J oz^ 

2uv 2vw 2wu 

cfl dxdy o? dydz a?' dzdx 

_J_^ 2w 

o?" dt'^ ^ o? dxdt dydt d^ dzdt ’ 

With a = oo, the Laplace equation is yielded for incompressible flow that 
serves as a basis for the boundary element methods in chapter 3: 






^ 02 ^ ^ 2 ^ 

dx^ dy'^ dz^^ 



( 2 . 1 ) 



The investigations of the present work are restricted to steady flows so that 
d/dt = 0 can be set for all time-derivations: 



I d^ ) dx'^ I d^ ) dy‘^ I d^ ) dz^ 

^ ^ ^ (2.2) 

2uv 2vw 2wu _ 

d^ dxdy (j? dydz d^ dzdx 

The local sound velocity a is a function of -h - jV^f'^). 

The full potential equation (2.2) is the fundamental mathematical basis for 
the nonlinear integral equation methods in chapter 4. 




366 S. Wagner, A. Rottgermann 

2.2 Fundamental Principles of Potential Theory 



Conservative fields are completely described by a scalar potential In agree- 
ment with Hess & Smith [17] and Maskew [32], the potential is considered 
in this work as potential energy, i.e. the field V(X, F, Z) is yielded from the 
negative gradient of the potential 

V{X,Y,Z) = -V^ 



An important tool for the potential theory is Greenes third identity: 







dS. (2.3) 



Equation (2.3) is valid on the conditions that ^ and its partial first- and 
second-order derivations are steady inside Q and that the volume integral 
converges for an unlimited domain [26]. The velocity potential ^ of the full 
potential equation (2.2) satisfies these conditions so that equation (2.3) can 
be interpreted as a solution of these differential equations. This formula is 
the mathematical basis of the field panel method in chapter 4. 

In the following it is assumed that the velocity potential ^ is a harmonic 
function in the whole domain i7, i.e. = 0 is valid for the Laplace operator. 
This reduces equation (2.3) to: 




Greenes third theorem can also be applied to multi-coherent domains. For 
instance in Figure 2.1 the space consists of the domains i?a exterior and Qi 
interior of S. 




Fig. 2,1. Outer Domain Problem 
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In these domains the velocity potentials 2 tnd are to exist. Greenes 
theorem is applied to both domains. Taking into account that the normals of 
both domains are of opposite directions ( 11 ^ = — n^) and adding both terms, 
the integral over the surface of the sphere E is obtained taking the limiting 
value R-¥ 00 : 

II '^(7) ]^ + <t>oo,P ■ 

(2.5) 

For the potential at an arbitrary point P € 5 [21], equation (2.5) for multi- 
coherent domains yields: 

= <t>oo,P + 2^^^ ~ 

+ ^ + (2.6) 

For harmonic functions equation (2.6) gives a solution for the potential at 
any point P. The equation is the basis for the boundary element method for 
the description of incompressible flows. 



3 Boundary Element Method 

If the velocity potential is a harmonic function in the whole space i? = 
Ha fii, it satisfles the Laplace differential equation (2.1): 

V‘^^ = 0. 

The velocity potential ^ can also be generated by superimposing the distur- 
bance potential (f) and the undisturbed initial flow potential 0oo ^ 

^ = (j) + (f)oo • (3-1) 

Since the potential of the undisturbed initial flow is only a function of the 
space, the following integral equation for the disturbance potential (f) is ob- 
tained: 

<t>P = \i<f>a-<l>i) + -^ II |-in„(V^„-V^i) + (<?^a-<^i)n„v(i)|ciS. 

(3.2) 

The solution of this potential equation can be interpreted as the superposition 
of a single and double layer potential. 
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3.1 Boundary Conditions 

In order to explicitly solve the integral equation (3.2) for a domain i7, poten- 
tial values have to be prescribed at the two boundary surfaces of the space 
in Figure 2.1. With increasing distance between the farfield boundary E and 
the surface «S, the disturbing influence of the body is getting less significant, 
the flow being undisturbed at the exterior boundary: 

lim ^ = lim (0 -h <^oo) = </>oo • 

r->oo r-4oo 

The potential in equation (3.2) is fixed, if the derivation of the potential 
and/or the potential itself is known on parts of the boundary or on the whole 
boundary S. In the following the relationship is valid: 11 ^ = n. 

Neumann: nV</> = -:^=t;n 
an 

Dirichlet : < t > = ^ 

In case of the Neumann problem, a value Vn is demanded for the normal 
derivation of the potential on the boundary. The Dirichlet-boundary condi- 
tion stipulates a value ^ for the potential on the boundary. For each of the 
two boundary value problems, the fulfillment of the boundary condition can 
be requested at the interior or exterior side of the border. 

In this work, the theoretical presentations will be restricted to the two 
subsequent formulations: 

1. Simultaneous use of single and double layer potential on the body surface. 
To obtain a definite system of linear equations, the additional conditions 
Ti+i = TNj,-i and /xi+i = -iJLNT-i (^ = Oj Ij • • • j Nt/ 2 - 1) are introduced. 
The system of equations is solved by using an external Neumann bound- 
ary condition. 

2. Use of an unknown doublet distribution on the contour surface together 
with a source distribution, on which the singularity strength is fixed at 

= 0 by an inner Dirichlet boundary condition. 

Since the surface of the investigated body is to be closed, fluid can neither 
enter nor leave through the border, i.e. the body is a flow surface and the 
kinematic flow condition has to be satisfied: nV^ = 0. Since the wake layer 
is assumed to be a flow surface, no fluid can pass through it; the jump in the 
normal derivation of the velocity over the vortex surface has to disappear: 
n • (V^o — V^tx) = 0- The jump in potential between upper and lower side of 
the vortex layer is the doublet strength of the wake: ixw := -(^o - ^u)- In 
order to simplify this notation, the factor ~ is taken into consideration for 
the singularity strengths. 
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Dirichlet 
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With these singularity strengths the basic equations of the boundary element 
method are obtained for the two boundary conditions . 



Neumann 



Dirichlet : 



n V jj T^dS -2'iTTp - nV jj fi n dS 
s s 

-nvjj dW = -nV (f>oo 

w 

// r^dS-2irnp- jj dS 
s s 

- jj /xvrnV^i^ dW = 0 



(3.3) 



(3.4) 



3.2 Discretisation of the Integral Equations 

In general, an analytic solution of equations (3.3) and (3.4) on the complete 
surface of arbitrary body geometries is not possible. For an approximate 
solution either the Galerkin method can be resumed, satisfying the basic 
equation on parts of the surface in a weighted mean sense, or the equation 
can be satisfied exactly by a collocation at a finite number of discrete points 
— the so-called collocation points or control points. All panel methods used 
in this work for the description of the incompressible fiow for two-dimensional 
profiles, three-dimensional translatory and rotatory fiows use the collocation 
method. 

For the numerical solution of equation (3.2), the integral equations (3.3) 
and (3.4) have to be discretised. The body surface is divided into individual 
elements, so-called panels, of which the end or corner points are located 
on the original contour. Since the determination of the infiuence coefficients 
for panel surfaces is much simpler than that for the curved elements, a plane 
compensation surface, where the collocation points are located, is constructed 
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for the individual surface elements [32]. In the following, equations are derived 
in case that constant source and doublet distributions are used piecewise on 
the plane panels. 

In this paper, only the discretisation for the Dirichlet boundary problem 
is indicated. The corresponding formulations for the integral equations (3.3) 
of the Neumann problem are given in [34, 36]. 



Dirichlet : 



i^k 

^ fiw, I j tiW,- - 0 (3.5) 



Since for the singularity densities Ti and /ij constant distributions are set 
over the individual panels, singularity strengths can be moved in front of 
the integrals. The following abbreviations are defined for the integrals of the 
kernel functions: 



Dirichlet : 




(3.6) 



The eigen-infiuences of the panels {i = k and j = k respectively) are included 
in the source and doublet matrices B and C. Then the discretised equations 
can be expressed for both boundary conditions in a unified way: 

N N Nw 

+ + = 0 k = l,...,N. (3.7) 

i=l i=l j=l 



Term (3.7) forms a system of linear equations with N equations for the deter- 
mination of the unknown singularity distributions. In this presentation, the 
infiuence coefficient matrices B,C and contain different values for the 
Neumann and Dirichlet boundary problem. In case of the Neumann problem, 
the matrix elements represent the normal components of the velocity influ- 
ences, whereas in case of the Dirichlet problem, the standard influences of 
the velocity potential are stored in the matrices. 

Putting all values known in the relevant time step to the right-hand side 
5 delds: 
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N Nw 

Dirichlet : = - 5] TjBfci - ^ fiw^ . 

i-1 j=l 

Thus, the system of equations to be established in the program are: 

N 

Dirichlet: '^mCki = 1lk k = l,...,N. (3.8) 

i=l 

In matrix notation this results in the formula A - fjL = TZ: 
ai,i ai,2 ••• ai,N x\ TZ\ 

02.1 02,2 • • • 02, AT X‘2 1^2 

03.1 03^2 • • • 03^iV ^3 zz ^3 

OiV,l OiV,2 * • • OiV,iV J L J L 

The elements of the influence coefficient matrix^ A correspond to the coeflS- 
cients of the matrix C from equation (3.6). 

If the singularity strengths of all panels are known after the solution of 
the system of linear equations (3.9), the velocity can either be determined 
at any arbitrary point P in space from the negative gradient of the velocity 
potential 

N N Nw 

+ ^l^iCki + -f- </>oo , (3.10) 

i=l i=l j=l 

or be calculated directly from: 

N N Nw 

Vp = -Vp<?P - Yl^iBki + + Voc . (3.11) 

i=l j=l 

The matrices for the velocity influences are defined as follows [32]: 




Wi 



® AIC — Aerod 3 Tiainic Influence Coefficient. 
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3.3 Vortex— Lattice Method 

3.3.1 Doublet Distribution and Vortex-Lattice. In the special case 
of a constant singularity strength /jLs, the doublet distribution, according to 
B. Hunt [20], is equivalent to a vortex ring of the strength F= fit = constant 
along the boundary of surface (Figure 3.1). 




Fig. 3.1. Equivalence between doublet panel and vortex ring 



In the vortex-lattice method, the wake layers are composed of adjacent, 
closed vortex rings. The circulation of individual vortex pieces results form 
the differences of the doublet strengths of adjoining rings. All wake elements 
consist of straight finite vortex filaments. In case of a straight, finite vortex 
filament of the length AS, the velocity induction is given by: 

r 

V = - — (cosai - cosa 2 ) . (3.12) 

47rr 

For r ^ 0 the velocity field has a singular point. According to equation (3.12), 
this results in induced velocities already in its surrounding which yield unre- 
alistic high amounts and directions. Therefore, different corrective procedures 
have to be introduced in order to correctly determine the velocities in the 
nearfield of a vortex filament. The effects of the different nearfield meth- 
ods on the velocity field and the shape of the vortex layers were extensively 
investigated and described by Urban [42] and Behr et. al. [5]. 

3.3.2 Discretisation of the Vortex Layer and its Development with 
Time. In this chapter, the vortex layer procedure ROVLM^ is discussed, that 
was developed by Urban [42], Wagner et. al. [47] and Behr & Wagner [1, 2, 
3, 4] for delta wings with leading edge separation and first used by Kramer 
[27], then by Rottgerman et. al. [33, 35] for the aerodynamic simulation of 
multi-blade rotors. 

The vortex-lattice method describes the development of the free wake 
vortex layer in case of an impulsive start of a wing or a multi-blade rotor. 
The developed method deals with thick wings with round leading edge and 
thus allows a separation of the vortex layer only at the trailing edges. 

At the beginning of the calculation, blade and airfoil are given without 
wake, i.e. the number of the wake panels is Nw = 0. As basic solution (t = 0), 
the system of equations (3.8) is solved with the right-hand sides being formed 



^ ROVLM — ROtor Vortex Lattice Method. 
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only from the normal components of the velocities of the undisturbed initial 
flow. For rotors, the initial flow consists of the translatory velocity Vq© and 
a component 17 x r from the rotation of the blade: 

1 ^ 

Tlk{t = 0) = [nfe(t) • (Voo,fc(t) + n{t) X Vkit))] Bki . (3.13) 

^ i=l 

These right-hand sides yield the unknown distributions Tk and [ik of the sur- 
face panels. With the singularity strengths of the starting solution, equation 
(3.11) yields the velocities at the points of the separating lines. 

The position of the wake points I shown in Figure 3.2a and 3.2b in the 
first row of the separated wake vortex layer result from the multiplication of 
the total velocities at the separating points with the time step At. 

Ti{t - 1) =VHK,l{t - 0) 

+ (Voo.j(* = 0) + n{t = 0) X VHK,l(t = 0) + = 0)) • At 

(3.14) 

The that way resulting points, together with the corner points of the panels 
on the trailing and lateral edges, form the edges of the vortex rings, from 
which the first row of the wake vortex layer is developed (Figure 3.2b). 

From the Kutta condition follows that the doublet distribution has to be 
steady at an edge, since any unsteadiness in /x would cause a discrete vortex, 
resulting in flow around the edge. To avoid that a vortex is originated at the 
trailing edge, the total vortex strength at the trailing edge has to disappear. 
The size of the circulation Fi = jiWj of the newly developed vortex rings is 
determined from the difference of the doublet strengths at the trailing edge 
panel of the sheet j in Figure 3.3: 

t^Wj f^HKaJ ~ ~ i,4^00HKa,j ~ ' (3.15) 

In the next time step, the newly developed wake row influences the total 
velocity at the control points of the blades which requires modification of the 
right-hand sides: 

j N Nw 

E • (Voo.fc(t ) + m X Vkit))] Bki . 

i=l 

(3.16) 

The coefficients of the matrices and are computed every time for 
each time step since the wake geometry is constantly changing during the 
iteration process, which is caused by the interferences between the wings and 
the vortex layers. 

By solving the system of linear equations with the modified right-hand 
side, the new strengths of the singularity Tk^lJik ^tre determined in time step 
t -h At. The new coordinates of the corner points of all wake panels {I = 
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Fig, 3,2, Developement of wake structure 



1, . . . , N\y) are yielded from the induced velocities of the airfoil panels Vf 
the vortex layer and the initial flow: 

Ti{t + At) = Ti{t) + (Voo,J W + fi(t) X Ti{t) + Vf ■ At . 

(3.17) 

where; 

N N Nw{t) 

= Yi + E = Y • 

i=\ i=l j=l 

The number of wake points Nw is increasing in each time step At. The 
elements of the coefficient matrices B and C, representing the influence of the 
wing on the wake, depend on the geometry of the vortex layer and therefore 
like the matrix have to be calculated every time in each time step. 

In the following time steps (Figs. 3.2c to e), the velocity vectors are deter- 
mined at all existing wake points with a new wake row arising in each case. 
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Fig, 3.3, Doublet strength at the trailing edge of the profile 



Since the position of the wake layer is determined from the induced velocities 
of the total system in each time step, the wake is always free of forces. The 
doublet strengths of already existing vortex rings remain constant and are 
transported throughout the wake. The shaded wake element in Figure 3.2 
indicates, how the circulation is pushed throughout the vortex layer. 

The wake vortex layers arise from a series of quasi-steady solutions of the 
potential equation. This procedure is being continued, until the circulation or 
drag distribution reaches an approximately steady or cyclic steady state. A 
more detailed description of the development of the wake vortex layer for thin 
wings with leading edge separation is documented by Behr et. al. [1, 3, 4, 6] 
and for thin rotor blades by Rottgermann [33] and Rottgermann et. al. [35]. 

3.4 Panel Clustering Method 

Computing the flow and the aerodynamic coefficients around three-dimensio- 
nal bodies (e.g. wings) by using a boundary element method, requires to di- 
vide the surface of the configuration into individual panels. In case of the 
chosen method, the discretisation has to be refined to solve high velocity gra- 
dients. Here, the orders of magnitude are quickly rising up to some thousand 
panels. Since all surface elements influence all other panels, the influence co- 
efficients matrices (3.9) are getting very large. Computation of the influence 
coefficients, their storage, as well as inversion of the matrices require very 
much computation time and storage capacity. 

The panel clustering method, developed by Hackbush h Nowak [10, 11] 
and further developed by Sauter [37, 38], offers a possibility of reducing these 
expenses. 

The basic idea of the method is to combine panels tt with an adequate big 
distance from the collocation point in clusters r. A detailed description 
of the panel clustering method is presented in [34, 36]. 



4 Field Panel Method 

Coupling of a boundary element method and a field method, e.g. finite- 
difference method yielded the program ROFPM^, facilitating the computa- 
tion of compressible flows without generating a field grid that has to be 



5 



ROFPM — ROtor Field Panel Method. 
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adapted to the contour. The sole task of the panel method in this compound 
is to fulfill the Laplace equation and the boundary condition on the body; 
the field method is only used to capture the compressibility. 

The idea is to divide the nonlinear differential equation (4.1) into a ho- 
mogeneous and a particular part. The homogeneous, linear part is treated 
by using the well-known panel methods, the nonlinear residual operator is 
solved by appl)dng the field panel method. 



4.1 Basic Equations 



The non-conservative potential equation (2.2) is transformed so that all 
derivations with a coefficient are put on one side of the equation: 



^ ^ ^ 2ut; ^ 2vw , 2wu 

^xx + ^yy + ^zz ~ o "*■ ^ • 

u, u, Cl (X 

For the right-hand side the abbreviation: 



(T ^2 ^XX + ^2 ^yy + -^^zz + ~^^xy 



, 2vw , 2wu ^ 

+ —^^yz + —^^ZX ■ (4.2) 



is introduced; equation (4.2) is a non-conservative formulation of the right- 
hand side the term a is used for the right-hand side, the complete potential 
equation is obtained: 

V^# = <7. (4.3) 

The value cr is a function of the derivations of the total velocity potential 
<^. The complete potential equation (4.3) is a nonlinear partial differential 
equation, that generally has to be solved iteratively. 

In an iteration procedure computing the right-hand side from the poten- 
tial values ^ of the preceding iteration step, cr can be considered as constant 
in the relevant iteration step: 



^ — constant . (4.4) 

Equation (4.4) is a Poisson equation with given constant right-hand side 
cr . This differential equation can be transformed with Green’s third identity 
(2.3) to an integral equation for the velocity potential. 

If the total potential ^ is reduced to a component for the undisturbed initial 
flow 0OO and a disturbance term equation (2.3) yields an integral equation 
for the disturbance potential </>: 



S S V 

The kernel of the volume integral in equation (4.5) corresponds to the kernel 
function of a single layer potential. This value a is described as a strength of 
the source distribution in the flow field or as a field source strength. 
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Splitting the surface integral into the components for the body S and the 
vortex layer >V, yields a nonlinear equation for the definition of the distur- 
bance potential: 




4.2 Boundary Conditions and Field Grid 

To include nonlinear effects, the volume internal in equation (4.6) hats to be 
evaluated. According to the break-down into a boundary element method 
and a field method, the disturbance potential is split up into the components 
from the panel method (f)^ and those of the field method <j)^\ 

4> = 4>P + 4,^ . (4.7) 

The kinematic boundary condition on the body surface is satisfied by the 
boundary element method so that at the exterior boundary of the field only 
the farfield boundary condition has to be satisfied for the disturbance poten- 
tial: 

lim (j) = lim (j)^ + lim = 0 . 

r-^oo r— ^oo r->oo 

As shown in chapter 3.1, the farfield boundary condition for (f>^ is satisfied 
implicitly by the singularity strength. Likewise, the condition Im = 0 is 

satisfied due to the 1/r - singularity of the field sources. 

Poisson equation (4.3) in connection with equation (4.7): 

yields with V^</>oo = 0 a condition for the disturbance potential inside the 
field grid: 

= (7 - . (4.8) 

Since the boundary conditions at the body and at the exterior boundary have 
already been satisfied, condition (4.8) is the only remaining condition to be 
satisfied explicitly. A Cartesian field grid can be used that can be generated 
without requiring much expenditure; it may penetrate the body arbitrarily. 
The grid boundaries run close to the contour. Since the farfield boundary 
condition is already satisfied at these boundaries, the grid size can be kept 
very small. 
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4.3 Iterative Solution of the System of Equations 

The nonlinear differential equation (4.4) is solved by an iterative procedure. 

Firstly, in iteration step n = l, the homogeneous part of differential equa- 
tion (2.1) is solved by the panel method described in chapter 3. From the 
kinematic flow condition nV</> = -nV</>oo together with equation (4.7) the 
relation is yielded: 

Thus the potential in the iterative step n is determined by: 

n = -n V<t>oo - n . (4.9) 

In the first iteration step n = 1, the field does not provide a contribution 
to the disturbance potential, i.e. = 0. From the boundary condition 

(4.9), a solution for the singularity strengths of the surface distributions of 
the body is obtained. These distributions yield the disturbance potential 
for all points in space. The components of the disturbance potentials of the 
panel method, the field and the initial flow together constitute the velocity 
potential: 

The distribution of the field source strength cr, according to equation (4.2), 
is a function of the potential ^ so that the values of a in iteration step n 
are determined from the values of the disturbance potential (j)^ of the same 
iteration step and from (f)^ of the preceding step: 

When the disturbance potential (j)^ and the field source strengths cr are 
known, a new solution for the disturbance potential (f)^ has to be calcu- 
lated. Equation (4.8) yields the rule for calculating the disturbance potential 
of the field: 

= (t'^ - • 

In the following iteration step, the field potential (j>^ modifies the boundary 
condition in equation (4.9). Thus, the contribution of the field to the total 
potential is determined independently of the boundary element method and 
iteratively coupled with the panel method via the boundary condition. The 
iteration process is being continued until the difference from the Laplace 
operator for the velocity potential and field source strength is below a given 
limit e: 

<e. 

Field values {<f>^ , a) or values of the boundary element methods such as (f)^ , 
Tk and fjLk are considered as convergence criteria. Since the present method 
captures the compressible influence over the discretised external space and 
therefore requires an evaluation of the field source strength in each iteration 
step, the program uses the modification of cr as break-off criterion: 
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max jcr" - ct" ^ | < e <7 ■ (4-10) 

i 

Typically, values in the domain Sa- — 0.01 ... 0.1 are used as convergence 
criteria. 



4.4 Discretisation 



To facilitate the computation of value a of the field source strengths, the total 
velocity has to be known for all field control points k. The contributions of 
the body and the vortex wake are determined in analogy to equations (3.11). 
The contribution of the field to this velocity is: 

= = (4.11) 

V 

After exchanging differentiation and integration, the derivations of the kernel 
function have to be integrated over V. The space surrounding the contour is 
divided into M grid cells covered by a constant source distribution in space 
of the strength cr. The whole cell has the value of a that results in the center 
of the volume element. The velocity at a control point k is obtained from the 
partial inductions of all field panels M: 



V 



FF _ 
k - 




(4.12) 



The velocities for two-dimensional plane field panels are determined from an 
analogous equation. The following kernel functions have to be developed for 
the individual velocity components: 



^ V* 

y. 

^ Vi 

Equations (4.15) for the induced velocities 
coefficient matrices of the field sources: 



M 

dx dy dz = ^ 5 

i=l 


(4.13) 


M 




dx dydz = ’ 

i=l 


(4.14) 


M 




dx dy dz = ^ • 


(4.15) 



result in the standard influence 



Dki = 




(4.16) 
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Together with the influence coefficient matrices of the boundary element 
method from chapter 3.2, all integrals of equation (4.6) are now available 
in discretised form. 

When establishing the iteration scheme, it has to be noted that all three 
components — body, wake and field — are influencing each other and also 
themselves. Thus, in comparison to the panel method, additional interactions 
ocur between field and body and also between field and wake. In Figure 4.1 
these interactions and the pertaining matrices are presented. 




After the grid generation, at first the influence matrices are calculated, of 
which the coefl[icients are not changing during the iteration process and can 
therefore be stored. 

The beginning of the iteration process presents the solution of the ho- 
mogeneous, system of linear equations for the panel method. By using the 
methods from section 3, the relevant singularity strengths of the surface dis- 
tributions are obtained. 

The total velocity required for the determination of the field distri- 
bution a in iteration step n at a point in space k consists of the components 
of the undisturbed free stream and the induced velocities of the body, the 
wake, and the field panel: 



N 



N 



Nw 



M 



i=l i=\ j=\ t=l 

Introducing the abbreviations: 

N N 

yr = E + E ’ (4-17) 



i=l 

Nw 



i=l 



vr=Ew,e»T. 



(4.18) 
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M 

= (4.19) 

i=l 

yields the equation for the field velocity: 

vf” = + 12" X + vf + vf + Vf ^'"-1 . (4.20) 

In the first step (n = 1) the field does not contribute to the total velocity since 
^n=o _Q described as starting value of the field source strength. In 

the following, the iteration index n is omitted to simplify the formulation, if 
all values of the variable come from the same iteration step. 

If the velocities in the field — and thus also the source strengths a of the 
field panel — are known, equation (4.19) yields the velocities inducing 
the field at the field control points. 

From the known spatial singularity distribution it is possible to compute 
the infiuences of the field on the body and on the wake. The velocity at 
the wake points I calculated with equation (3.17) have to be extended by the 
induced velocity from the field at the wake points: 

vr = Voo,/ + r? X r, -k Vf ^ 4- Vf ^ 

with the components defined as follows: 

N N Nw M 

vr =5^T.B„+5;ft4, vr=Y.”A- 

i—1 i=l j=l i=l 

(4.21) 

If the wake shape is allowed to change during the iteration, the infiuence 
matrices have to be modified for each iteration step. The new location of the 
corner points of the wake panels are computed equivalent to equation (3.17): 

rf+i = rf + Vj^’" • At . 

The field hats also an infiuence on the velocity at the collocation points: 

M 

= Y.(TiDki. 

i=l 

These induced velocities can be interpreted as an additional free stream ve- 
locity and will be accounted for in the right-hand sides of the system of 
equations. Therefore, the term in equation (3.16) has to be modified: 






1 ^ 

-si: 



M 



njb • ( Voo,/k -f ^2 X Fife 4- ^ (^iDki 



i=l 



Nw 
Bki 

i=i 



W,n 



(4.22) 

The new right-hand sides change the singularity distribution on the surface 
of the body which also results in new velocities in the outer domain. After ad- 
justing the strengths a of the field sources, new values of the right-hand sides 
TZ can be determined. This procedure is being continued until the strengths 
of the field sources fulfill the convergence criterion (4.10). 
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4.5 Calculation of the Field Source Strengths 

The strengths a of the field sources are determined from the velocities at the 
field control points which are calculated from the gradient of the potential 




Writing equation (4.2) with the derivatives of the velocities, the strengths g of 
the field sources are determined from the velocities and their first derivatives: 

V? 2uv 2vw 2wu , . 



4.6 Artificial Viscosity 



Since the full potential equation is obtained on the assumption of an adiabatic 
and irrotational flow, it follows from the energy equation that the flow about a 
contour has to be isentropic. The artificial viscosity T stabilizes the solution: 
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(4.24) 



The artificial viscosity T is added to the strengths cr of the filed sources at 
supersonic points: 

= ^ij,k + '^i,j,k • 

Using this modified Poisson equation (4.8) in the iterative procedure it is 
possible to treat flow problems with weak shocks. Test cases have shown that 
the artificial viscosity can stabilize the solution as long as the Mach numbers 
in front of the shock are Ma < 1.3. At higher velocities, the flow can no longer 
be considered to be isentropic. Therefore, the full potential equation is not 
suitable for flows with strong compression shocks. 



4.7 Coupling of the Boundary-Element and the Field Method 

The vortex-lattice method for rotors (ROVLM) mentioned in chapter 3 is 
coupled with the field panel method (FPM) to calculate compressibility ef- 
fects at the rotor blade tips. 

The combined code includes two iteration loops — one is responsible for 
the calculation of the wake shape, and the second is introduced, to correct 
the compressibility of the flow. Building up the vortex-lattice for circulatory 
flows requires about 50 up to 100 time steps. The nonlinear potential equation 
has to be completely iterated for each time step. 

To limit the computational effort of the coupled method, Kandil & Yates 
[23, 25], propose a procedure where the wake iteration is computed with fixed 
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field source strengths a and the compressibility iteration is performed with 
fixed circulation. This procedure continues until the strengths of the field 
sources and the doublet distribution in the wake are converged. 

Firstly in an incompressible iteration a force free wake is built up for 
a steady blade solution. Gennaretti et. al. [9] and Caradonna & Tung [7] 
have proven in a numerical and an experimental form that the influence of 
compressibility on the geometry of the wake shape is very limited and can 
be neglected. Therefore the wake is treated as a prescribed wake during the 
compressibility iteration. The singularity strengths on the surface S change 
during the iteration steps, just as the doublet strengths [xhk at the trailing 
edge of the blade. The new wake doublet strength = I^hKo ~ I^hKu is 
shed into the wake and one obtains a solution which just differs slightly from 
the exact solution. Since only the doublet strengths of the wake are modified 
but the wake geometry remains unchanged, the wake isn’t anymore force 
free. This could be corrected by a few iteration steps to remodel the vortex 
sheet. The resulting vortices at the trailing edge are very small so that this 
modification of the wake shape is neglected in this method. 

A further advantage of the prescribed wake geometry is that one does not 
have to calculate the influences and the influence coefficients of the blades, of 
the field grid and of the vortex sheet on the corner points of the wake vortex 
rings. 



4.8 Reduction of Computational Costs 

Integral equation methods can compute boundary values on the body’s sur- 
face S very efficiently, but need a lot of computation time and storage when 
calculating velocities in the outer domain V. 

The major part of cpu-time and storage requirement is used for the 

— calculation of the influences between the body and the field 

— calculation of the influences between the field and the field 

— calculation of the influences between the field and the body 

In order to limit the computational effort, it is necessary to couple the field 
panel method with finite-difference methods which are more efficient in cal- 
culating stream values in the outer domain than integral methods. Therefore, 
an AF-scheme, a finite-difference method for calculating the velocities in the 
outer domain and a trilinear interpolation for calculating the velocities at 
the collocation points were used [34]. The reduction in cpu-time and storage 
requirement is shown in Figure 4.2. 



5 Results of Rotor Flows 

The theoretical background described in the previous chapters has been re- 
alized in a two-dimensional and a three-dimensional code. 
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Fig. 4.2. Reduction of cpu-time and storage [34] 



In the present paper, only incompressible and compressible calculations 
of the three-dimensional code are presented. These calculations are limited 
to steady flows of a hovering rotor. 

To validate the program ROFPM, several test cases of the model rotor of 
Caradonna & Tung [7] were calculated. The CARADONNA-Rotor consists of 
two untwisted and untapered blades with NACA 0012-sections (aspect ratio 
A = 6). Since the program should investigate the blade-vortex interference, 
the discretisation has to be refined at the blade tip. Although the root vortex 
influences the velocity distribution in the inflow plane of the rotor disk [48], 
it has no eflFect on the pressure distribution at the tip. For this reason, the 
discretisation at the root is not refined so the total number of surface panels 
can be kept small. In Figure 5.1 a blade of the CARADONNA-rotor [7] is 
presented. 



y/c 



1 . 0 '- 

x/c 



Fig. 5.1. Discretisation of the CARADONNA-testrotor 



Both blades are discretised into N = 360 surface panels with 12 strips of 
30 panels each in chordwise direction. The panel edges are located at 16.7, 
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33.3, 45, 55, 64.2, 71.8, 77.5, 82.5, 87.2, 90.8, 94.5 and 97.5 % of the blade 
radius. 

Figure 5.2 shows the vortex sheets emanating from the trailing edges of the 
blades for a collective pitch angle, = The vortex sheets are calculated 
by the incompressible vortex-lattice method ROVLM [33, 35], see section 

3.3. 




Fig. 5.2. Vortex sheets of CARADONNA-rotor (0^ = 8”) 



The blades rotate by an azimuthal increment = 15° in each iteration 
cycle. The emanating vortex sheet builds a tip vortex, whereas the root vortex 
cannot be simulated because of the coarse discretisation at the root. The wake 
of the first revolution melts to the so-called starting vortex [33] of a diameter 
larger than the rotor disc. If rotor and wake produce enough induced velocity, 
the starting vortex declines and the following vortex sheet forms a helical 
surface which can be assigned to each blade from which it is emanated, see 
Figure 5.2. 

The cartesian field grid shown in Figure 5.3 contains 24 cells in chordwise 
direction, 14 cells in spanwise direction and 16 elements in the direction of 
the rotor axis (M = 5376 field cells). 

The side view in Figure 5.3 illustrates that the field grid does not have 
to be adapted to the body and may penetrate the contour at an arbitrary 
position. Due to the collective pitch, areas of nonlinear flow are expected 
to appear on the upper side of the blades. Therefore, the grid is shifted 
upwards {zmin/c = ~{}.2b,Zmax/c = 0M). The top view in the same Figure 
shows that it is suflScient to stretch the boundaries of the field grid 0.1 chord 
length in front of the leading edge and behind the trailing edge {xmin/c = 
— 0.1, Xmax/c= 1.1). In hover, nonlinear effects are due to occur in the tip 
region. The spatial field is restricted to a relatively small area at the tip of 
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the blade (ymin/c=4.0,yrnaa;/c=6.3). The three-dimensional view illustrates 
the extension of the field grid and the blade. 



Back-View 






Fig. 5.3. Field grid at the tip of blade 
(24 X 14 X 16 fieldpanels, -0.1/1. l;4.0/6.3; -0.25/0.65) 



The grid shown in Figure 5.2 and 5.3 was used for all calculations con- 
cerning the Caradonna test rotor. 

The wake structure of the incompressible vortex-lattice method is used 
as prescribed wake in the compressible iteration. During each iteration cycle 
the shape is kept constant, and the calculation of the induced velocities at 
the wake corner points can be neglected. 

The first test case is a lifting hovering rotor with a tip mach number 
Ma^ip = 0.439 and a collective pitch setting of ©c = 8°. The comparison 
between the calculated pressure distribution Cp of the field panel method and 
an Euler code with measured data is presented in Figure 5.4 at different radial 
positions. 

Although this is a subsonic catse, compressible effects are clearly visi- 
ble. When comparing the incompressible vortex-lattice method and the field 
panel method, it can be seen that the pressure distribution on the upper side 
of the blade is underestimated by the vortex-lattice method. The field panel 
method is in good agreement with measurement up to the suction peak, which 
is underestimated. A reason could be that the discretisation of the leading 
edge is too coarse to resolve the pressure gradient. The Euler code also under- 
estimates the suction peak^. Compressibility effects occur at a radial position 
greater than r/R^ 0.70. The field panel method corrects the incompressible 

® Equivalent results for Euler codes can be found as well by Hertel [15], Hertel 

et. al. [12, 13, 14], Kramer et al. [28, 29], Kramer [30] and Stangl et. al. [40]. 
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Fig. 5.4. Pressure distribution of the CARADONNA-rotor for 0c =8°, Matip = 0.439 



vortex-lattice results at the outer radial positions, whereas at the inner sec- 
tions the compressible iteration of the field panel method has no remarkable 
infiuence on the results. The comparison of all pressure distributions shows 
that the field panel method overestimates the pressure on the lower side of 
the blades. 

To prove that the field panel method allows to predict transonic fiow, 
the CARADONNA-rotor was calculated for the collective pitch &c = 8° and 
the tip mach number Ma^^p = 0.794. Near the tip of the blades, supersonic 
regions with shock waves appear. The pressure distribution Cp at specified 
radial positions is plotted in Figure 5.5. 

The results of ROFPM are in good agreement with the measured data. 
In particular, the magnitude and location of the shock are represented very 
well. At the outer most radial position, (r/i? = 0.96) the suction peak on 
the upper side is underestimated. Likewise a difference in the suction peak 
occurs at the radial position r/R = 0.80. The fact the Euler method and the 
field panel method underestimate the suction peak, could be a hint, that the 
measured data is wrong^. Comparison with the Euler solutions of Kramer 

^ See Kramer [30] for a detailed discussion of the measured data. 
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Fig. 5.5. Pressure distribution of the CARADONNA-rotor for 0c =8°, Ma^jp = 0.794 



[30] shows better agreement of the field panel method with measured data in 
pressure distribution at the leading edge and prediction of the shock at the 
blade tip. 

The residuum of the field panel method needs 18 cycles of the compress- 
ible iteration to reach the convergence criterion =0.01. The program needs 
90 minutes cpu-time for these 18 cycles on an IBM RISC/6000-320. Adding 
the cpu-time to determine the wake structure, we need 135 minutes, corre- 
sponding to 5% of the computation time for the Euler solution of Kramer 
[30] on the same machine. 



6 Conclusions and Outlook 

A field panel method has been developed, which takes into account com- 
pressible, transonic effects in panel methods which have been developed for 
linearized potential equation. Basis of the nonlinear extension is the steady 
full potential equation which is formulated as a Poisson equation = a. 
The right-hand side of the Poisson equation includes all nonlinear terms of 
the potential equation and is interpreted as the singularity strengths of a 
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field source distribution. The Poisson equation is solved by an iterative pro- 
cedure, where the Laplace operator is solved with a linear panel method and 
the influence of the field sources a is included in the right-hand side Tt of the 
linear system of equations. 

The field panel method allows to treat two-dimensional and three-dimen- 
sional flow problems. In case of three-dimensional calculations, ROVLM is 
used as the incompressible boundary element method which calculates the 
free wake of translatory or rotational flows. The extensions for the consider- 
ation of the compressible effects are included in ROVLM. 

Several advantages of this method are listed below: 

o The field panel method uses a Cartesian grid which does not have to be 
surface fitted, thus avoiding the major task of grid generation, 
o The boundary conditions are fulfilled by the panel method alone so that 
the grid is used only to calculate the source strengths. In practical cases, 
the field source strengths, which represent the compressibility, decay away 
from the contour so that the computational grids can be restricted to small 
regions, where the nonlinear effects cannot be neglected, 
o The integral equation method can be combined with other finite-difference 
methods to reduce computation time and storage requirement. 

The method is limited due to the theory and to numerical simplifications: 

o The unknown singularity strengths of the boundary element method are 
taken as constant distribution over the panels 
o To reduce storage requirement and computation time for the calculation of 
the influence coefficients of the field, the field grid is modeled by a Cartesian 
grid with equidistant length in all directions, 
o Up to now, it has not been possible to treat unsteady flows. 

A rotor encounters transonic effects at the advancing blade and at the 
tip. Since there are only subsonic velocities or weak shocks at a rotor, the full 
potential equation is suitable for treating helicopter flow problems. Since the 
basis of this method is the steady full potential equation, the investigation is 
restricted to steady hover flight cases. The field grid is limited to the tip re- 
gion, but computes also the influence on the neighboring subsonic regions. At 
the root and in the major part of the surroundings the flow can be considered 
to be incompressible so that no field grid is required. 

The ROFPM code was tested with the Caradonna test rotor for sub- 
sonic and transonic cases. The results were compared with measurements 
from Caradonna & Tung [7] and with Euler code calculations Hertel [15] 
and Kramer [30] . Although the code W 2 ts developed as a preliminary design 
code, the results show the same quality as the solutions of the Euler code but 
requiered only 5% of the cpu-time used with the method by Kramer [30]. 

Since the program ROFPM is able to calculate the shape of the free 
wake, the code could be used to investigate the blade vortex interference 
as well as the influence of the body on the rotary flow. The possibility of 
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calculating complex geometries and interference with little effort makes the 
code a good tool for computing complete helicopter configurations in the 
preliminary design phase. 

Future developments of the ROFPM code and the field panel method 
could be in the following fields: 

o Extending of the program to unsteady forward flight using the unsteady full 
potential equation. Including flapping and pitching motion of the blades, 
o Investigating of blade-vortex interferences in the transonic tip region and 
analyzing the influence of the body on the rotor flow, 
o Using more efficient field methods to reduce storage requirements and cpu- 
time. 

o Modifying the panel clustering method to calculate the influence coeffi- 
cients of the wake elements in a more efficient way. 
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Interpolation, Triangulation and Numerical 
Integration on Closed Manifolds 
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1 Introduction 

The first part of the paper deals with the interpolation and triangulation 
of closed manifolds. Let T be a closed surface in which is implicitly 
given by a function H{x) = 0. We present a new algorithm which creates a 
“nearly” uniform triangulation of this manifold. Our ideas are based on the 
procedures by E.L. Allgower [1] and on local geometry properties [2]. The 
algorithm allows a parallel discretization and was tested for several examples. 

In the second part of the paper we study the numerical computation of 
weakly singular integrals. We develop an adaptive algorithm for the calcu- 
lation of a class of boundary integrals. Furthermore, a method is presented 
which allows the construction of -smooth boundary and finite elements 
on a given closed manifold T C . Finally, Hadamard’s regularization tech- 
nique is used to reduce the computation of singular boundary integrals to the 
computation of weakly singular integrals. 



2 The algorithm for the approximation of a closed 
surface in by triangulations 

Here we describe a parallel algorithm for eight transputers which yields a 
nearly uniform triangulation of a closed surface implicitly given by H{x) =0 
where x — {x\^ X 2 ,xs). We assume that for all corner points of the triangles 
there holds H{p'^) = 0. 

The following input data are required (see figure 2.1): 

- a surface T of a body i7 given hy H{x) =0, where H{x) <0 in the interior, 

H{x) >0 in the exterior of Q and V//(x) 0 in a vicinity of F ; 

- the number N of partitions; 

- a starting cube. 

The starting cube of edge length 2 1 must contain F completely (see fig- 
ure 2.2). This cube is divided into eight subcubes associated with the eight 
transputers. Each subcube is then divided into cubical subdomains. In 
the algorithm only the so-called suitable cubical subdomains are used, where 
suitable means that in the subcube there exist points p* with H{p^) > 0 and 
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Fig. 2.2. Partition of the surrounding cube with respect to the number of 

processors 
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Jp with H{jP) < 0. The points p* which are situated at the outer faces of a 
suitable cubical subdomain are projected in the direction VH{p^) onto the 
surface F (see figure 2.3). The following algorithm determines this projection: 




Fig. 2.3. Projection of the outer points onto the surface F, 



1st step: Define := and choose 

p~ := p^ + 1 VJT(p*) with t G E such that H{p~) < 0. 
2nd step: Calculate 



3rd step: If \H{p*)\ < e then p* := p* and proceed to the next point p^ 

4th step: If if(p*) > 0, then p+ := p*; otherwise p” := p* and return to 
step 2. 

In the examples below we chose £ = 10“®. 

Next, we divide the generated quadrangles on the surface into two trian- 
gles along the shortest diagonal. 

Unfortunately, in general these triangles are nonuniform. Therefore a cor- 
rection of the triangulation is necessary. We take for each point the arith- 
metical mean value of his adjacent neighboring points, replace the point by 
the mean value and map it back onto the surface. Sweeping twice through 
the surface points, we obtain a more uniform grid (figure 2.4). 

The subgrids generated by the eight transputers will be fit together. 
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Fig. 2.4. correction of the grid. 




2.1 Examples 

The following four examples are considered: 

1. the sphere: H{x) = xf -h + X3 - 1; 

2. the ellipsoidal surface: H{x) = H- \x\ +X3 - 1; 

3. the torus: H{x) = (xf +xi +x| + - {jfY - [xl xl)\ 

4. the double-sphere: H(x) = I , f L T ^2 T ^2 f ~ n’ 

\ (xi -f -I- xi -I- x| - i : xi < 0. 

For the starting cube we choose I = 1.0, and for all examples the number of 
partitions is chosen as TV = 20. The resulting meshwidth is S < 0.05. The 
generated triangulations are corrected three times. The computations were 
executed on a transputer-board with eight transputers T800 and with an 
accuracy of 10“^^. The results are demonstrated in the following table: 





number o: 
triangles 


points 


time/nui 

1 


nber proa 
2 


sssors; Sp 
4 


>eedup 

8 


sphere 
figure 2.5 


9072 


4538 


211 sec 
1,0 


129 sec 
1,6 




72 sec 
2,9 


ellipsoid 
figure 2.6 


6656 


3330 


145 sec 
1,0 


90 sec 
1,6 




51 sec 
2,8 


torus 
figure 2.7 


7120 


3560 


168 sec 
1.0 


103 sec 
1,6 


72 sec 
2,3 


mEi 


double-sphere 
figure 2.8 


6000 


2974 


127 sec 
1,0 


80 sec 
1,6 


57 sec 
2,2 





Remark: 

The results show that the speedup is small. The reason for that is that large 
parts of the algorithm are running in succession and the loadbalancing is not 
solved very well. To avoid these problems, we should use one processor as 
“master” where the main part of the algorithm is done and the remaining 
processors as “slaves” which do the necesarry computations. This is explained 
more detailed in [3]. 






























Fig. 2.T. torus Fig. 2.S. double-sphere 



2.2 Extension on surfaces with edges. 

The surfaces considered in the examples of the last section were smooth 
except the double-sphere, i. e. the gradient was defined at each point of the 
surface. In order to circumvent the problem caused by the non-existence of 
the gradient along the edge of the double-sphere, two “tricks” are applied. In 
this example it is possible to choose the starting cube in such a way that the 
curve of intersection of both spheres is situated in the common face of two 
adjacent subcubes. Furthermore, we define a “modified gradient”: 

{ (2xi - 2x2, 2x3) for xi > 10“^, 

(2xi -h |, 2x2, 2x3) for xi < -10“®, 

(0, 2x2, 2x3) otherwise. 

Clearly, this approach is only possible if we consider a grid where the curves 
of intersection of the composed bodies lie in a plane. 
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In addition we considered bodies with more general edges: 

1. A cylinder with edges (see figure 2.9). 

2. Two cones glued together (see figure 2.10). 

3. A triple torus and a satellite (see figures 2.11 and 2.12). 

4. A cube where three cylinders are cut out. This cube has the following 
geometrical singularities: 

a) comer points, 

b) edges at the exterior faces, and 

c) complicated edges in the interior of the cube. 

The difficulties occuring here can be classified as follows: 

1. The comer points will be cut by the algorithm and replaced by one 
triangle face. A modification is necessary where several triangles with 
the given corner point cis common vertex are used. 

2. The algorithm produces a fringed grid at the exterior edges. This can be 
avoided by an appropriate choice of the starting cube and modification 
of the gradient. 

3. Our approach does not work for edges which appear at non-plane inter- 
faces of composite bodies. 

4. The exact calculation of the gradient for surfaces of complicated structure 
is expensive, hence it should be approximated numerically. 

2.2.1 Numericad approximation of the gradient. We compute the 
gradient at the point approximately with the help of centered differences: 



V//(x°) 



H(x°+ee^)-H(x^-e eM \ 
2e 

H(x°+ee^)-H(x'>-e e^) 

2e 

, H(x°+ee^)-H(x°-e , 

\ 2s / 



where e* is the i-th unit vector in R® and s — 10~®. In order to decide wether 
the gradient is well-defined or has a jump at x“, we calculate the gradient at 
all points adjacent to x°: 



:= +ie^ + j + ke^ with i,j. A: € {-1,0, 1}. 



If 

and therefore 



Vi/(x‘-J’*) 0 and VF(x°) 0 



(VH(x«>*),Vif(xO)) 
|Vi/(x‘-l.*)| |Vif(xO)| 



> d with S = 0.8 for all x*’-'’*. 



the gradient is well-defined. If this inequality is not satisfied for at least one 
of the points x*’-^’*, then the gradient has a jump in the vicinity of x°. In this 
case x° is an edge or corner point of the surface F. 
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However, our algorithm needs a well-defined gradient. Therefore we re- 
place xo in the discontinuous case by a point close to on the initial triangle 
where the gradient exists, then this replaced point is projected onto the sur- 
face. 

We now assume that a triangulation with grid points on the surface F is 
available. As described before we correct the triangulation in order to obtain 
a more uniform grid. Unfortunately, in the neighborhood of edges and corners 
this method fails and a special correction for these geometrical singularities 
is necessary. 

2.2.2 Correction of the corner points. The corner points, which are 
known as such, have to be included into the grid. This is realized by replacing 
the grid point which is one of the closest to the corner point, by the corner 
point itself. Grid points which are corner points will be marked; and they are 
fixed in the mean value correction procedure described before. 

Applying this correction procedure we end up with a nearly uniform grid 
with correct corner points. 

2.2.3 Correction of the edges. The algorithm for the correction of the 
edges will be described in seven steps: 

1st step: Determine the triangular line (p^; p^) which satisfies 



\VH{p^)\\VH(p^)\ 



<5 for 6 = 0.8 ; 



i.e. one edge point lies between p^ and pi^. Points with VH{p) 0 are 
edge points which are marked and fixed in the following steps. 

2nd step: Define 

pI :=p^ and pg :=p^ 

3rd step: Determine the mid-point 



12 Pk+Pk 



Pk ■= 



and project p]^^ onto F such that 



H(Pk^) = 0 with pI^ := p\^ + XV . 



4th step: If 

{VH(pi^),VH(pi)) {VH(pi^),VH{pl)) 
lVi/(pi2)| \VH{pi)\ \VH(pi^)\ \VH{pl )\ ’ 

then put 

1 12 j 2 2 

Pk+i ■= Pk und Pk+i ;= Pk ; 

otherwise put 

Pk+i — Pk und pI+i :=pi^. 
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5th step: If 

\Pk+i -Pk+i\>E 

then return to step 3. (In the examples above we chose e = 10“®.) 

6th step: If 

\p^-pI^\<\p^-pI% 

then put 

:= p]? and mark p^ as an edge point; 

otherwise put 

p^ := p\^ and mark p^ as an edge point. 

7th step: If there is another triangular line with an edge point then start 
with step 1 again. 

The resulting grid (except the corner and edge points) is corrected anew 
by the mean value correction procedure described at the beginning of this 
section. The following examples show that the resulting triangulations are 
nearly uniform. 

2.3 Examples with edges 

1. cylinder: H{x) = max||x 3 | — 1, xf -l-xl — l| 

2. two cones: H{x) =max||x 3 | — 1, xj +X 2 — (1 ~ 

3. triple torus: ^(x) = (xf d-x^-l-xlH-l.O)^— 8max |xf-l-X 2 , X 2 +x\^ 

4. satellite: H{x) = min |/:iri(x), H 2 (x), Hs(x)^ with 





i min jxj + X 2 - 0.05, xf + x| + (|x 3 1 - 4)^ - j 

[ xf +xi + (|x3|-4)2-i 


[ if|x3|<3, 
else. 




i min |x 2 + — 0.05, x| + x| + ([xil — 4)^ — ^ j 

[ x| +x| + (|xi| -4)2 - i 


[ if|xi|<3, 
else. 


J^3(X) = | 


f min |xi + Xg - 0.05, xf + x| + (|x 2 | - 4)^ - | j 

[ xf +xi + (|x2|-4)2-i 


[ ifNI < 3, 
else. 


cube: H{x) = max|ifi(x), 7/2 (a:) | with 






Hi{x) = max||xi| - 1, |x 2 | - 1, |x 3 | - ij 
H 2 {x) =max|| -X? -Xa, ^ - xl - xl, ^ 


-4} 
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With the modified algorithm we obtained the following results: 





number of 


CPU-time for the mesh I 




triangles 


points 


generation 


correction 


cylinder (Fig. 2.9) 


2672 


1338 


12,0 sec 


140,0 sec 


two cones (Fig. 2.10) 


12752 


6378 


225,8 sec 


459,9 sec 


triple torus (Fig. 2.11) 


14976 


7452 


304,6 sec 


896,9 sec 


satellite (Fig. 2.12) 


19056 


9530 


495,3 sec 


6078,6 sec 


cube (Fig. 2.13) 


10032 


5008 


142,6 sec 


500,6 sec 



2.4 Convergence 

In this section we investigate the convergence of the algorithm for a polyhe- 
dral domain. To this end, we introduce barycentric coordinates: 

Definition 2.1. Let be n-\-l points which are not all situated on 

a hyperplane of . Then the values Ao(x), . . . , An(a:) € E associated with the 
point X with 



^Ai(x) = l and x — Aj(x)t>^ 

i=0 i=0 

are called the barycentric coordinates of the point x. 

The barycentric coordinates are defined by the points in E^ 

we need four points. 

Since we need the barycentric coordinates with respect to a plane triangle 
in E^ , we choose an auxiliary point vq for the fourth barycentric coordinate 
in E^ . Then \o{x) = 0 for all x lying in the plane defined by the triangle. 
The corresponding procedure is given as follows: Choose the point e. g. 



0 4- 4 



4 






, 11 ’ 



calculate the \i{x): 





( 1 


1 


1 


1 


\ 






/ 


1 


1 


1 


1 


\ 


1 

:= — det 


X\ 


1 


v\ 

2 


v\ 

3 




, Ai(a:) 


1 , 

:= — det 




0 


Xi 


Vi 

2 


3 




D 


X2 


^2 


^2 


^»2 






D 




V2 


X2 


V2 


^2 






\ X2 


^2 


V2 


V2 


) 






[ 


V2 


X2 


V2 


V2 


) 




1 


1 


1 


1 


\ 






f 


1 


1 


1 


1 


\ 


1 , 

:= — det 


0 


v\ 

1 


X\ 


v\ 

3 




, A3(a;) 


1 , 

:= — det 




0 


1 


2 


Xl 




D 


V2 


^2 


X2 


^2 






D 




^2 


^2 


V'2 


X2 






\v! 




X2 


vl 


1 






[ 


V2 


^^2 


V2 


X2 


/ 



D = det 



1 


1 


1 


1 \ 


V? 

vl 




vl 




^2 


vl 


vl 




^2 


^2 


vl / 



where 
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Wllm 



MH 



0mK^v 



ff Irvi^i'i 












^^ifelPi 



Fig- 2.9. cylinder 



Fig- 2.10- two cones 



Fig, 2.12. satellite 



Fig- 2-11. triple toms 



Fig. 2.13. cube 
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Now we show the convergence of our algorithm: 

We assume that the surface F given by H{y) = 0 is approximated by 
a polyhedron Fh which consists of triangles {i = 1, . . . , AT) of maximal 
diameter 

h := max | |x - y| x^y £ A^, i = 1, . . . , iv| . 

Let us consider a triangle A^ which is spanned by the points and 

. At the same time these points generate a curvilinear triangle T* on the 
surface F. In short, we write 

A for A\ for (analogously, and v^) and i? for F\ 

Let y be an arbitrary point of f2 and y' its projection in the direction of 
gradient VH{y) onto A. Our aim is to show that 

|y-y'|->0 converges as h ^ 0. 

For any point x we have 

H{x) = H{y) + VH{y){x - y) + H{y*){x - y)\ 

where y* G fi. We assume that H is piecewise (7^(i7)-smooth. 

Then, for the points v^,. . . we have 

H{v*) = H(y) + yH{y){v* - y) + - y?. (2.1) 

With the help of the barycentric coordinates, the point y' £ A can be repre- 
sented as 

3 

y' = ^>^i(y')v\ 

i=l 

Insert this formula into (2.1) to obtain 
3 3 r 

'£\iiy')H{v^)=f^\iiy') H(y)+VHiy)iv^ -y) + ^D^ H{y-){v* -yf . 

i=l 

With H{v*) = H{y) = 0, this yields 

^ 1 
0 = E - J/) + 2 O' H{y*){v^ - yf 

i=l 

= VH{y){y' -y) + J2 \ O' H{y*)iv^ - y)\ 



( 2 . 2 ) 
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With the estimate 







-y\‘^D^H{y*) 



(v‘ - 

|V» - J /|2 




max < max 
1<*J<3 I a;€f2 




because of \v^ - y\ < h, we obtain from equation (2.2) the second order 
estimate 






< 



h'^ f 

— max < n; 
2 1<U<3 \x 



max 

xeo 



\ max 

z£f2 



9yi dyj 



(x) 



d^H 
dyi dy^ 

h 






h'^ 



M. (2.3) 



Since VH{y) is parallel to {y - y'), i.e. Vif(y) = c{y- y'), it follows that 
\^Hiy){y'-y)\ = \VH{y)\\y'-y\. 

Furthermore, since V/f(j/) 7^ 0 for all y e 12, equation (2.3) implies 



P.4) 

Hence, we have shown that the polyhedron converges to the surface P : 
H{y) = 0 foT h 0 quadratically. 



3 An adaptive method for the computation of surface 
integrals 



Our aim is to calculate integrals of the form 




First, a mesh Af C P is generated. Then a small positive number (5 > 0 
is chosen as a threshold which is given by the overall accuracy required. 
The mesh Af corresponds to a polyhedron V := {Aj}i<KM , where the A^, 
1 < i < M are created by the grid points of Af. The corner points of the 
polyhedron V are located on P. We use once more the projector Pr{x) := 
X 4- \{x)VH{x)^ which is described in section 2 Let 

'ji := Pr{Ai) for 1 <i < M. 

The Ji C P are obviously curved triangles. If the mesh Af C P is sufficiently 
fine, then the family becomes a triangulation of the surface P, 

i. e. we may require the following conditions: 
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1. For all 1 < j < M with i ^ the intersection 7 * fl jj is either an 

empty set or a common line or a common corner point, and 
M 

2. r c u li- 

i=l 

Under these conditions, we have 




ip da. 



The integrals 




if da 



will be evaluated separately. Therefore we introduce a local coordinate trans- 
formation 

y = Ax + b, 



in accordance with figure 3.1 




Fig. 3.1. transformation to local coordinates 

whereby the triangle is mapped into the coordinate plane 

( 2 / 15 ^ 2 )- For simplicity we denote the corner points of the triangle A{ by 
€ J\f. We further denote the transformed function of H hy H as 

well. Let 

:= Ax^ 4-6 for 1 < i < 3. 

Finally we introduce the projector Q : defined by 

Q{y\,yx,yz) ~ {y\,y‘i) for all {yi,y‘2,ys) e K®- 



Now let 

Wi Qiji), 
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yi 

Fig. 3 . 2 . projection of 7* onto the plane (yi,y2) 

i. e. Ui is the projection of ji into the plane (2/1, 2/2) as depicted in figure 3.2. 

Corresponding to the local transformation, the integral I{ji) is trans- 
formed cts well and replaced by: 

I(7i) = j i’(yi,y2) dyi dy2 « hiji) := ) + V’fa ) + V’fa ) |^_|^ 

where | A^l is the measure of the triangle Aj. The next, second approximation 
^2(71) will be computed according to the following refinement procedure: 

First, introduce four auxiliary triangles 

where the corner points and are determined as follows: 

1. For all 1 < fci < 3: 

yl-M ykt^ 

y‘ 2 ;ki Qo Pp j Tvitii n = {k + l) (mod 3), > (3.1) 

:= Q o j with /r = (iki + 2) (mod 3), 

and 

2 . 

;= QoPr(^n^y 




(3.2) 
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Fig. 3.3. refinement of Ui 



These four new triangles represent a triangulation of Ui as shown in figure 
3.3. We calculate 









and obtain the second approximation of the integral /( 7 i) by 



Hli) ~ E 



Next, we check the inequality 

|/i(7i)-/2(7i)l<<5 

with the given tolerance J > 0. If this inequality is satisfied then we choose 
^{li) •= Otherwise we compute the next approximation h(ni) of /(7i) 

and proceed with the next subdivision by replacing one triangle by four new 
ones, e.g. the triangle by 

> l<fc2<4. 

The corner points ^ ^ calculated by using (3.1) and (3.2), 

after replacing y^ by y^'^ and by 

Then one computes the numbers Is{Ai^i^k 2 ) with the help of formula (3.3) 
by modifying the indices correspondingly and obtain 

4 4 

hili)= /3(Ai,i,^J H- ^ 

/f2=l ki=2 



Now we check the inequality 
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Mli) - /3(Ti)l < <5. 

The further steps are obvious. 

It can be shown that the error is of the order O ) if ^ and of the 

order O )? if the function ip possesses a weak singularity, denotes 

the degree of refinement. 

With this algorithm, the integrals appearing in boundary element meth- 
ods can be computed numerically. Figure 3.4 shows the needed computational 
time depending on the number of original triangles on the surface F and fig- 
ure 3.5 shows the accuracy of the computation of the double layer potential 
on an ellipsoid where the singularity is situated on the surface. 




Fig. 3.4. CPU-time for the computation of the double layer potential on an 
ellipsoid with the singular point situated on the surface 



4 Boundary elements of the class and finite-element 
bases of the class C^. 

Again, we consider an implicitly given surface F := {a: G | /7(a:) = 0} and 
a mesh M = {x*}i<Kiv C F. We assume that M is dense enough in order 
to determine a triangulation of F which approximates F well enough. Let 
1 < f < AT be a fixed index. 

We introduce local coordinates {yi’>y 2 ^yz) with respect to ^ M by the 
transformation 

y = A{x — X*), for x G , 

where the matrix A is chosen such that the plane {yi,y 2 ) coincides with the 
tangential plane of T in xL A is a rotation and A~^ = A*. 
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Fig. 3.5. Accuracy for the computation of the double layer potential on an 
ellipsoid with the singular point situated on the surface 



Let Mi := C be the neighboring points of given in 

local coordinates by 

(yi^yi^vi) ~ - a;’) for 1 < j < P 

where 

‘= (yi.yi) 

denotes the projection of the grid point {yi^yi^yi) ^ ^ onto the tangential 
plane (yi,y 2 )- Without loss of generality we assume := (0,0). 

Now we consider the triangles 

Aj := [y^,y^,y^~^^] for 1 < i < P - 1 and Ap := [y^,y^,y^] 

P 

as shown in figure 4.1, together with the closed region uji := (J Aj. 

j=i 

Now we are in the position to formulate and to determine the desired inter- 
polation. 

For simplicity, we denote the function H after the transformation x i — > y 
again by the symbol H. Let 

1. h :ui — > E be the solution of 

H {yi , y-i , h{yi , j/ 2 )) = 0 for all {yi,yi) ewl 



= {(i/iij/2,y3) 

h{yuyi)y 



and 
2 . 7i : 



{y\,y2) e Aj, for 1 < j < pandps = 
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i=i 



We assume that the mesh is already dense enough such that the condition 

N ( p \ 

u U I is satisfied. 

i=\ \j=i ) 

Now we formulate the following interpolation problem: Let 

[pj^oc ^ ® I 0 < i ^ a € with |a| < 1} 

be given. Then we have to find the function Wi : ZJi — > M with the following 
properties: 

1 . Wi £ 

2. 5°^ Wi{yj) = pj^a for 0 < j < p and for all a G Nq with |a| < 1, 

3. Wi consists of piecewise polynomials of degree at most three. 

We solve this interpolation problem by determining Wi on each triangle A* 
separately. For doing so, we choose one triangle, e.g. Ai, and determine Wi 
on Ai with the help of a Clough-Toucher element. 

By the aim of the center of gravity 

_ y° +y^ +y^ 

3 

we obtain three triangles 

Ai,o = [z,y\y‘^] , Ai,i = [y°,z,y ‘^] , and Ai,2 = [y°,y\z] 
as shown in figure 4.2. 

Now we introduce bary centric coordinates with respect to these triangles: 

1. A = (Ai, A2,Aa) for Ai,o with = (1,0,0), y^ = (0,1,0), z = (0,0, 1); 

2. n = (/xi,/X2,/i3) for Ai,i with y® = (1,0,0), z = (0, 1,0), = (0,0,1); 

3. 1/ = ii/1 .!/•>. 1/3 i for Ai 9 with iP = (1,0,0), = (0, 1,0), z = (0,0, 1). 
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Fig. 4.2. subdivision of A i by the center of gravity 
and determine corresponding Bezier polynomials 



ai +0:2 +0=3 = 3 



aila-ilas! 



. \^2 . \03 



l(iUl,M2,M3) = Y, 






0:i+a2 + Q:3=3 



ai!a2!o:3! 



oi +02+0:3=3 



r 1/^1 . i/«2 . J.0C3 
I I I ^2 ^3 • 

ai a2!a3! 



Figure 4.3 illustrates the calculcation of the coefficients a«, and 
which are given by the following formulae. The rj^a are parameters which are 
to be chosen later. 




Fig. 4.3. Bezier-control-points 
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1- 0003 = boos = ^2, (0,0) 5 ^300 = C300 = ^0,(0,0)j Q )30 = »030 = ^,(0,0)j 



2 . 



4 . 



5 . 



0021 = n,(o,o) + 2 [^,(1,0) • (2/1,2 - 2/1,1) -f n,(o,i) • (2/2,2 

0012 = ^2,(0,0) g [^2, (1,0) • (2/1,1 - 2/2,1) +^2,(0,1) * (2/1,2 

^^102 = ^2,(0,0) + 2 [^2,(1,0) • 2/2,1 + ^2, (0,1) * 2/2,2] , 

6201 = ^o,(o,o) + 2 [^o,(i,o) * 2/2,1 + To, (0,1) • 2/2,2] , 

C210 = ^o,(o,o) + 2 [^o,(i,o) • 2/1,1 -I- ^o,(o,i) * 2/1,2] 5 

C120 = n,(0,0) - 2 [^1,(1)0) • 2/1,1 H- r i,(o,i) • 2/1,2] , 

0021 + ^i,(o,o) + ^120 
0120 = C021 = 2 ’ 

, 0012 + ^2, (0,0) + ^102 

0102 = 0012 = ^ , 

, ^^201 + ^0,(0,0) + C2IO 

0210 — C201 — 2 ’ 



Olll 



_ oq2i + aoi2 + 0102 H- ai2o ^i,(o,o) + y*2,(o,o) 



^111 



Cm 



_ ^102 + 6201 4 - 6210 + &012 _ ^2, (0,0) H- y*o,(o,o) 
2 ” 4 ^ 

C210 d- C120 + Cq 21 + C201 y*0,(0,0) + ri^(o^o) 



O210 = C012 = 



Cm + C021 4 - Olll 



0201 — C021 



Olll + 0102 4 - 6111 



bl 20 — C102 — 



bin H- ^^210 + Cm 



O3OO — ^030 — Co03 — 



2^021 -I" ^120 4- 0210 



-2/1,2)] , 
- 2/2,2)] , 



3 
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In the same way one determines the corresponding polynomials po, p\ and 
P2 for the triangles A2, . . . , Ap. The desired interpolating function is finally 
given by 



'^i{y) { 



{ Po(A) 
Pi(m) 

i P‘iiy) 



for all 
for all 
for all 



y e Aj,o with l<j<p, 
y E A^-,1 with l<j<p, 
y e Aj^2 with 1 < j < p. 



p 

Now let Ti := \J and 

j=i 



Wi{yi,y 2 ,y 3 ) ■=Wi{yi,y2) for all (yi,j/2) e Wi, y^ = %i,j/2). 

With the help of these functions, both, boundary elements of the class 
and finite-element-bases of the class can be generated. 

First we investigate the boundary elements. Here we define 

•= d^h{y^), for 0 < j <p, and a G Mq with |a| < 1 , ( 4 . 1 ) 



and 



<^j ■= {{yuV'^ys) e | {yi,y2) G A and j/3 = Wi{yi,y2)} . 

This set of surface patches aj generates a set of boundary elements of the 

class because of the following properties: 

- For ji 7^ the set is either empty or a common line or a common 

corner point, 

- the crossing of two arbitrary adjacent elements is continuous and their 
tangential planes coincide on the crossing line, 

- the corner points of any element gi are situated on F and the tangential 
planes of F and Gj coincide at the corner point. 



In order to obtain the basis functions for the finite element spaces we solve 
the interpolational problem for all yS E Mq with |/ 3 | < 1 and 

r if J = 0 

^j,oc •“ { a E with |a| < 1 . 

y 0 if 1 < j < P 

With the help of the determined interpolation : ZJi — > M we generate 
the functions 

( WiAv^y-i) for all (j/i,j/2) G Wi and yz = h{yi,y2) 

0 for all (y 1,2/2, 1/3) G r\ri. 

The functions Wi^f3 are defined everywhere on F belonging to the class on 
F ; and they are linearly independent. The linear span of these functions is 
the desired finite element space on F called boundary elements. 
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5 On Hadamard’s regularization of derivatives of 
potentials 

We consider a singular integral of the form 

J D^E{x,x')Wj{x)dtJx (5.1) 

Fi 

where E{x,x*) is the fundamental solution of an elliptic equation, n = 
(Mi 1^2) Its) is a given direction at the point x £ F and 

^ d d d 

P 

for x* € N C r and = |J jj. As a model we consider 






\x - x*| ’ 



where the regularization can be performed explicitly. With the help of the 
coordinate-transformation y = A • {x — x^) from the previous section, one 
obtains 



D, 



1 



\X - Xi 



= D, 



^1 • yi + 1/2 • y2 4- 1/3 • 2/3 



Vvi+yl+yl (j/i + 3/2 + yl)^ 



and 






Let Q := fLen the function h satisfies 

M0,0) = 0, ^(0,0) = 0, and ^(0,0) = 0; 

hence h is of the form 

Hyuy) = e^%i,j/2) 

with a smooth function h. This yields 



and 



D, 



(2/1+2/24-2/3)" = r (1 + Q h { yi , 2/2)) ' 



1 _ / 2/1 . 2/ 2 . /i\ 1 

— — I 2/1“^ -f 2/2 “o + 2 / 3 — I J. 

V ^ ^ sj (1 + 



\x — x^\ 



With the notation 
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we obtain 



/ 

Ti 



D. 



jT ■ Wj{x) 



-- j *^2^ ^3 dyidy2 - j vz^ <Pj dyxdy2, 



which implies the formula 



p.f. 



■I 



yi , 2/2 

1^1 -3 + 1^2 -3 

r J 



V>i dyidy2 



-I 



r'l - 



1 d(fi 
Q dyi 



1 d(p 

Q dy2 J 



dyidy2. 



This formula describes nothing but Hadamard’s regularization of the sin- 
gular integral with the help of weakly singular integrals and hence makes its 
numerical computation possible. For more details on the numerical integra- 
tion of Hadamard finite part integrals see also [4]. 
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Asymptotic Expansions of Elastic Fields 
in Domains with Boundary and Structural 
Singularities 

A.-M. Sandig and W.L. Wendland 



1 Introduction 

It is well known that the solutions of elliptic boundary value problems in do- 
mains with corners, edges and interfaces develop local singularities at these 
geometrical and structural peculiarities. Based on V.A. Kondratiev’s funda- 
mental work [31], G. Fichera [23, 24], V.A. Kondratiev and O.A. Oleinik 
[33, 34], V.G. Maz’ya and B.A. Plamenevski [41, 44], M. Dauge [18], V. 
Kozlov, V.G. Maz’ya and C. Schwab [37], T.v. Peter sdorff [53], H.Schmitz, 
K.Volk and W.L. Wendland [62] and M. Costabel [14] have analyzed the local 
asymptotic expansion for the elliptic system of elasticity. This local expansion 
can be obtained by using Mellin transformation with respect to the distance 
to boundary or structural singularities. The Mellin transformation yields pen- 
cils of differential operators depending on a complex parameter. The resulting 
holomorphic eigenvalue problem provides the basic information on the local 
asymptotic expansion determined by the distribution of the corresponding 
eigenvalues and the associated Jordan chains of generalized eigenfunctions. 

Instead of treating the boundary value problem directly, one can also use 
equivalent pseudodifferential equations on the boundary. One reduces the 
problem to appropriate boundary integral equations for the missing Cauchy 
data. Using a Wiener-Hopf-technique or the Mellin transformation, one can 
obtain a local asymptotic expansion of the unknown Cauchy data about the 
boundary and structural singularities. In particular, this technique is used 
for cracks and interface cracks between different materials in [16, 20, 21] and 
[62] . The singular expansions are needed for criterions of crack propagation 
and its modelling on one hand and, on the other hand, for the compensation 
of the deterioration of numerical stress analysis due to the well-known global 
pollution effect of singularities. The improvement of any numerical stress 
computation can be achieved either by mesh grading based on the a priori 
knowledge of the form of the singularities [2, 3, 6, 55, 57, 54, 39] or by adap- 
tion. The latter is accomplished either by automatic local mesh refinement 
and increase of approximation order based on local error indicators [1] or via 
model adaption by incorporating explicit singular expansion terms into the 
trial spaces as in the augmented Fix method [10, 64]. All these techniques are 
used in finite difference, finite element and boundary element discretizations. 
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This paper is devoted to the analysis of three-dimensional elastic fields 
involving singularities and some related boundary and finite element meth- 
ods. In Section 2 we derive boundary integral equations for three-dimensional 
crack and interface problems. We discuss the direct formulations, where the 
relations between the Cauchy data via the Calderon projection are used, as 
well as indirect methods, where the solutions are reproduced through chosen 
potentials with unknown densities. In Section 3 we present results on the 
asymptotic expansions of the unknown Cauchy data (stress or displacement 
fields) in the firamework of boundary integral equations for domains with 
cracks, interface cracks and polyhedral corners [20, 21, 62]. In Section 4 we 
describe asymptotic expansions of the displacement fields in domains with 
edges or polyhedral corners. We formulate some new results on Neumann 
problems and give modified coefficient formulae in domains with edges [56]. 
In Section 5 we present the relations between the two asymptotic expansions, 
one generated by Mellin techniques for boundary integral equations and the 
other by Mellin techniques applied to the original Dirichlet problem as de- 
rived in [38]. For three-dimensional problems it can be shown that the Jordan 
chains of an associated one-dimensional boundary integral equation on some 
curve 7 on the unit sphere is related to corresponding chains belonging to 
an operator pencil on the sphere, and their jumps across 7. Section 6 is con- 
cerned with mesh grading for h-versions of finite element domain methods 
based on a priori knowledge on the form of the singularities and asymptotic 
expansions about corners and edges [5]. This general method can be extended 
to boundary element methods as it is done in [54] for a mixed boundary value 
problem for the Laplacian in a polyhedral domain. 



2 Boundary integral equations for three-dimensional 
crack and interface problems 

Let be a three-dimensional domain consisting of two domains i?i and 
i?2 occupied by possibly different anisotropic materials where either both 
domains are bounded or i?2 is the unbounded exterior \i?i . The boundaries 
dQi^i =1,2, are divided into two parts; one part is the interface F = df}\ fl 
9i?2 with transmission conditions, the other part Fi consists of the boundary 
pieces where boundary conditions are imposed (see Fig. 2.1 and Fig. 2.2). 
If i?2 is unbounded, we assume that appropriate radiation conditions are 
satisfied at infinity. Cracks between two different materials, the so-called 
interface cracks as well as cracks in an elastic homogeneous body i? of one 
material are included; the lips of the crack are supposed to belong to the 
boundary pieces Fi, 
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n 




Fig. 2.1. A bounded coupled 
structure 



Fig. 2.2. A coupled structure 
with unbounded exterior 



2.1 Crack problems 

Let the domain i? C describe a homogeneous anisotropic elastic body 
containing a crack surface S in the interior. The closure S can be represented 
as a manifold with a (7^-smooth boundary dS {k > 3). The crack-faces 
are denoted by and the corresponding restrictions of a function u by 
— u\s± • The equations of linear elasticity read 

Lu = D^CDu = 0 in i7, (2.1) 

where u is the vector- valued displacement field, C is the 6x6 symmetric, 
positive definite matrix defined by the 21 elastic moduli (see [23]) and 

/ 9i 0 0 0 ^3 02 \ ^ Q 

D^\ 0 d2 0 as 0 , dj = —. 

\ 0 0 53 ^2 9i 0 / 

The Dirichlet and Neumann conditions for the crack read 

= 9 ^ ( 2 . 2 ) 

and 

T{d^,n{x))u\s± :=N'^CDu\s± (2.3) 

respectively, where 
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( n\ 0 0 0 77-3 ri 2 \ 

0 ri2 0 77-3 0 ni j 

0 0 77.3 ri2 ni 0 J 

For the isotropic case, the Dirichlet and Neumann cracks have been investi- 
gated in [16]. In the following we will present the results from [20]. For brevity 
we assume H = R^ \S and the radiation condition 

u{x) = 0 ( 1 ) for |a;| -> 00 . (2.4) 

Theorem 2.1. [20]: Let G H 2 (S) with go ■= {g~^ ~ 9~) ^ ^2 
below). Then u £ H[^^{R?\S) is the solution of the Dirichlet crack problem 
(2.1), (2.2) and (2.4) if and only if 

u{z) = y(p{z) - Vgo{z) for z £ R^\S, 

^ _i 

where ip £ H 2 (S) is the solution to the pseudodifferential equation 

Vip{x)= ^{g'^ + g~) + K*go (x) for x € S. (2.5) 

Here Hf (S) := {u £ Hf {S) : supp u £ 5}, where 5 is a closed compact C^- 

surface with S S and H 2 ^ {S) = (5)j . The condition go £ Hf (S) 

ensures the compatibility between g^ and g~ on dS. 

Furthermore, we have used the notations 

V ^[{z) := f E{z - y)'i[{y) dyS for z £R^ \S, 

XJ(p(z) := J[T{dy,n{y))E{z-y)]^ (p{y)dyS for z G 1^\5, 

K*'ip{x):= f T(dx,n(x))E(z - y)'ip{y) dyS for x £ S, 
s 

Vt/j(x) := f E(x — y)'i[{y) dyS for x G 5 
5 

with a fundamental solution E for the operator -L. 

Theorem 2.2. [20]: Let t^ £ H 2 ^ (S) with to := {t~^ - 1~) £ H 2 ^ (S). 

Then u £ H[^^{W\S) is the variational solution of the Neumann crack prob- 
lem (2.1), (2.3) and (2.4) if and only if 

u{z) — yto{z) — \5xl){z) for z £ M^\5, 

1. 

where 7 /^ G Hf (5) solves the pseudodifferential equation 

Di[{x) = + N*to {x) for X £ S. (2.6) 
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The hypersingular integral operator D is defined by 

D'tpix) = T{dx,n{x)) j [T{dy,n{y))E{x - y)]^ rpiy) dyS. 

s 



2.2 Interface problems 

Let i? = i?i U i ?2 U r be a composite as introduced before (see Fig. 2.1 and 
Fig. 2.2). The interface or boundary transmission problem then reads: 

Find functions Ui^i = 1,2, such that 

LiUi — 0 

BiUi — Qi 

- U‘2 = hi 

T\Ui — T2U2 = /l‘2 



in Oi, 
on Pi, 
on r, 
on r. 



(2.7) 



The operators Li and Ti are defined by (2.1) and (2.2); the boundary opera- 
tors Bi describe Neumann or Dirichlet conditions on Tj. In short, we say that 
(2.7) is a D— D interface problem if on Fi and /I 2 the Dirichlet data gi 
are given; (2.7) is a N-N interface problem if on Fi and F 2 the Neumann 
data gi = U are given; and (2.7) is an N-D interface problem if on A the 
Dirichlet datum and on A the Neumann datum are given. 

The derivation of the corresponding boundary integral equations for these 
problems requires a few particular steps which we sketch in the following: 

(i) [21, 12, 58] 

— Consider extensions Gi of the boundary data gi onto all of F. 

— Solve the decoupled boundary value problems and write the solution 
as a solution operator acting on Gi. 

- Determine the extensions Gi in such a way, that the transmission 
conditions on F are satisfied. 

(ii) [47, 12, 19] 

Assume that the boundary pieces Fi coincide with the crack faces , 
i.e. we consider interface cracks (see Fig. 2.2). 

— Consider extensions Hi of hi onto S = F \J dF U F\ and identify 
Fi=F2. 

— Solve the “pure” transmission problem and write the solution as a 
solution operator operating on Hi. 

- Determine the extensions Hi in such a way, that the boundary con- 
ditions on Fi are satisfied. 

Both solution procedures can be realized with direct methods [67, 65, 16, 58], 
where the relations between the Cauchy data, described by the Calderon 
projector, are used. They were also carried out with indirect methods [47, 
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12, 20, 29], where the solutions are represented by chosen potentials with 
unknown densities. We shall formulate results for both methods: 

Interface Neumann cracks (direct method): 

We present here an extension of the approach in [16, 65] to coupled structures, 
developed in [58]. Let us consider the domain fi = R^ = Ui ?2 U5 (see Fig. 
2.2), where S = dOi = FU OF U A is the common (7^ -boundary consisting 
of the interface (or contact zone) F and the crack faces A — F 2 . As before, 
we introduce the potentials 

Yiip(z) = j Ei{z- y)tp{y) dyS, and 
s 

^iX(z) = J [T{dy,n{y))Ei{z - y)]^ xiv) dyS for 2 G E* \S. 
s 

Ei is the fundamental solution of the operator — A,i = 1,2. The correspond- 
ing boundary integral operators (pseudodifferential operators) are denoted by 



Vi^Pix)^ I Ei{x-y)iP(y)dyS, 
s 

Kl'ij)(x) = J Ti{dx,n{x))Ei{x - y)'i/j{y) dyS for x e S. 
s 

We use the approach (i) and consider some chosen extensions Gi E 
of the Neumann data gi E (A). Every further extension Gi E (5) of 

Qi can then be written as Gi = Gi-\- G^ with G^ E H ~2 (F). Assume that the 
solvability conditions for the decoupled Neumann problem for the bounded 
domain i?i is satisfied, i.e. 



(^Gi + G ^ , := J {Gi -f Gi)cj dS = 0 for j = 1, . . . , 6, (2-8) 

5 

where ej are the traces of the linearly independent rigid motions. 

Based on the concept (i) and the relations between the Cauchy data on dHi 
which are given by the Calderon projections [17], 



/ u, ^ /|/ + (-!)% i-iy^^Vi \fui 

\TiUiJ V (-im \TiUi, 

we define the corresponding Steklov-Poincare operators 



(2.9) 
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The inverse Poincare-Steklov operator which maps the boundary tractions 
into the boundary displacements is well defined for the Neumann problem in 
the exterior domain i ?2 satisfying (2.4) and with the projection properties of 
the Calderon projection one gets 5^^ = V2(— -h For the bounded 

domain i7i we use the pseudoinverse operator acting 

on functions satisfying (2.8) (see [21]). 

If the exterior domain is replaced by a bounded domain i ?2 (see Fig. 2.1), 
then we use the pseudoinverse S 2 ~^\ correspondingly. 

For the Dirichlet, the Neumann and the mixed problem we have the following 
results obtained in [58]: 

Theorem 2.3. (D-D interface problem): Let Gi E H^{dQi),i — 1,2, he 
some extensions of the Dirichlet data gi of the D-D interface problem (2.7), 
which satisfy the compatibility condition 

hi -r\r{Gi-G 2 ) = hi G 5^(r). 

Let Gi—Gi-\- Gj be the desired boundary trace with G^ E Then 

Ui(z) = {-iy+^[ViiSiGi)-Ui{Gi)]iz) forze Gi,i^ 1,2, 
are the uniquely determined solutions Ui G of the D D interface 

o 

problem (2.7), if and only if the functions satisfy the following system of 
boundary pseudodifferential equations on the interface D: 

(I -I \(gA^( hi-r\r{Gi-G2) \ 

\ r\rSi -r|r 52 J \ G., J ^ ^2 - r\r{SiGi - S2G2) j ' 

Theorem 2.4. (N-N interface problem): Let Gi E H~i{df2i) be some ex- 
tensions of the Neumann data gi = U of the N-N interface problem (2.7) 
which satisfy the solvability conditions 

(Gi,ej) =0, for j = i = l,2, (2.10) 

\ / d^i 

where ej are the traces of the rigid motions of the form [axx]-\-b with a,b E 
. Let Gi = Gi + Gf G H~ 2 (dfii) be the desired boundary stresses satisfying 
(2.10). Then 



Ui{z) = i-iy+^[Vi{Gi)-Ui{Sl-^^Gi)]{z), i = l,2 

are solutions ui E of the N-N interface problem (2.7), if and only 

0 

if Gj are solutions of the system of boundary pseudodifferential equations on 
the interface D: 

( \( ] = ( hi - r\r{S[-'^Gi - si~^^G2 \ 

\ r\r -r\r J \ G., J \ h 2 - r\r(Gi - G 2 ) )' 
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Theorem 2.5. (N-D interface problem): Let G\ £ H and G 2 ^ 

H^{df2i) be some extensions of the Neumann datum g\ = t\ and of the 
Dirichlet datum g 2 , respectively, so that (2.10) holds for i = 1. Let G{ — 
Gi~\-G^ E (df2i) be any other extensions of gi where G\ also satisfies 

(2.10). Then 



u,{x) = [ViiGi)-Ur{s[-^\Gi)]{x), 



U2{X) = -[V2{S2G2)-U2{G2)\{x) 



are the solutions Ui E of the N-D interface problem (2.7), if and only 

0 

if G^ are the solutions of the system of boundary pseudodifferential equations 
on the interface F 

( r|r5<-‘> -r|r 'l f 0. \ / Xi 

V r\r -r\rS 2 J \ G^ J \ ^2 / 

Interface Neumann cracks (indirect method) [21]: 

In the indirect approach we seek the solution ui as a single layer potential of 
the Neumann interface crack problem (2.7) in the form 

Ui{z) Vi'ipi^z) for ^ E fli, (2.11) 



where xfi on S are still to be determined. Here one has the following result. 
Theorem 2.6. ; The function u E i7/^^(M^\5) given by 

, ^ r Vi [E-=i + {\I + Klf-^\G, + Gi)l {z), for z £ /2i 

uiz) = < r ^ 0 1 

I V2 [{-\I + K*^)-^{G2 + G^)\{z), forzeOi 
is a solution of the Neumann interface crack problem (2.7), if 
g = ((?i,ci,...,c 6) eH-i(r) Xi« 
is a solution of the system of pseudodifferential equations 

S-^G = f={f,du...,de), ( 2 . 12 ) 

provided the conditions (2.8) together with the compatibility conditions 
hi - - S^^Gi) |r - fti G nlin, 

h2-[di-G2)) \r=h2eH;^{r) 



are satisfied. Here 
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5-1 = 



r\r{s[-^^ - A* 

A 0 



(2.13) 



is a matrix of operators, where A*{{c\, . . . ,ce}) = CjCj, 

A(?i = |(Gi,ej\ I , f = hi-r\rS^^^^h2, di = -(Gi,ej\ 

G <2 — Gi — h‘ 2 . The restriction or trace on F is denoted by r|r- 



and 



3 Asymptotic expansions on the boundary 

Here we present results on asymptotic expansions of the Cauchy data (stress 
or displacement fields) near corners and edges (crack fronts included) in the 
framework of integral equations. Two approaches are applied, the Wiener- 
Hopf method [22, 16, 20, 21] and Mellin techniques [62]. 

3.1 Crack problems 

In [22, Section 9], the Wiener-Hopf method is adapted to the investigation of 
scalar multi-dimensional pseudodifferential equations in the half space . 
This method is generalized to systems of pseudodifferential equations in [22, 
Section 26] and [9]. The Wiener-Hopf technique is used by M. Costabel and 
E. P. Stephan in [16] in order to get the explicit asymptotic expansions for 
the displacement and traction fields at the crack surface in three-dimensional 
homogeneous isotropic bodies. They have derived the explicit factorization of 
3x3 matrix symbols of the corresponding integral equations. These results 
were generalized in [20] to cracks in homogeneous anisotropic materials by 
using an implicit factorization of symbols. In [21] interface cracks between 
two anisotropic materials are investigated. In this case the corresponding 
symbols of (see 2.13) are not explicitly factorizable and do not have the 
generalized transmission property anymore. The explicit form of the leading 
exponents of the asymptotic expansion of the surface traction near the crack- 
fi:*ont is then derived by suitable adaption of the Wiener-Hopf method. These 
results read as follows. 

Theorem 3.1. (Dirichlet crack) [20]: Assume that the right-hand sides of 

the Dirichlet crack problem (2.1), (2.2) and (2.4), belong to H 2 ~^^{S), 
— Then the boundary traction field r is the solution of the bound- 

ary pseudodifferential equation (2.5) and has the following asymptotic expan- 
sion: 

r{s,r) =rj{r)ro{s)r~^ +Ti{s,r), 

where r dist{x,dS) and s denotes the arclength on dS, tq G 
Ti € The function r] is an appropriate cut-off function. 
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Theorem 3.2. (Neumann crack) [20]: Assume that the right-hand sides t^ 

1 j 

of the Neumann crack problem (2.1), (2.3) and (2.4) belong to H 2 {S) with 
-| < cr < |. Then the boundary displacement field xj) is the solution of the 
boundary pseudodifferential equation (2.6) and has the following asymptotic 
expansion: 

'([{s,r) =r?(r)V^o(5)r2 +^i(5,r), 
where 'i/jq ^ H 2 ^"^{dS) and 'ifi € H 2 ~^^{S). 

Theorem 3.3. (Neumann interface crack) [21]: Assume that the right-hand 

“ “f" (T 

sides of the Neumann interface crack problem gi —U ^ Hp (Fi) and hi £ 
Hp^^ with—^ < cr < -|. Then f £ (F) in (2.12) and the 

o 

solution Gi of (2.12) has the asymptotic expansion 



/ ri 0 0 \ / ci(s) \ 

Gi(s,r) =»j(s)K(s) 0 0 02 ( 5 ) +W"(s,r), 

V 0 0 ri-*'' / V C3(s) / 

(3.1) 

where the coefficients in the first term depend on the arclength s and belong 
to Hp^^ (OF) and W{s,r) £ Hp'^'^{F) for a' < a, respectively. S = S{s) — 
^'^ 2 !^^ ez^en?;a/ite of the matrix 0, —1) 5(5, 0, 1), where S denotes 

the symbol of S~^ . The matrix K{s) is given by the principal symbol matrix 
ofS-K 

A calculation of the exponent 6 in the expansion (3.1) has already been 
done by M. L. Williams [69] and T. C. T. Ting [68], based on results of 
the two-dimensional case. In [21] we present a rigorous justification for the 
three-dimensional case. As an example we calculate the eigenvalues Xj {s),j — 
1,2,3 for two isotropic materials with the Lame constants Ai,/xi and A 2 ,/X 2 , 
respectively, with our method (see [21]). With the help of local coordinates 
Kj we have 

5(s,0,±l) = [k;(0,x)T“'5(±1)k;.(0,x)^ 

with 

/ 74 0 0 \ 

5(±1) =2 0 75 ±*76 , 

V 0 The 75 / 

where 

_ \k + 3/Xfc _ Afc + fik _ fJ'k 

2/Xfc(Afc + 2/4;!;) ’ Afc + 3jWfc ’ \k + 2fj,k 

and 
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74 



5]7l.A=(l+72,fc), ,, , 76 

k=l k=l ^ 



71,173,1 71,273,2 

1 - (73,1)^ 1 - (73,2)*'^ 



The eigenvalues of<S ^(-1)<S(+1) are the zeros of 
det[<S(l) - XS(-l)] = 8(1 - A)[7|(1 - A)^ - 7|(1 + A)^] = 0. 

Thus we get 

Ai = 1, A2 = As = and <5 = (52 = ^^-InAs- 

75 + 76 A‘2 27T 

Corollary 1. There are no oscillation terms in (3.1), i.e. each Si vanishes if 
and only if 70 = 0, i.e. if and only if ^ = i- 2 ul > where Ui = 

Poisson ratios for i = 1, 2. 



This result coincides with results by Williams [69] and Ting [68] . 



3.2 Corner problems 

The singular expansions of edge singularities can locally be reduced to two- 
dimensional problems [42, 48, 45] and will be described in Section 4. Here we 
consider the three-dimensional singularities of elastic fields near polyhedral 
vertices where edges meet. This approach is based on the papers [62, 38], 
where Mellin techniques for boundary integral equations are used for the 
reduction of the boundary value problem to a one-dimensional boundary 
integral equation on a closed curve on the sphere. 

Asymptotic expansions near vertices of rotationally symmetric cones under 
symmetric [8] and non-symmetric loads [60, 35] are special cases and are 
calculated by applying the Mellin transformation to the original boundary 
value problems. 

Let i? be a polyhedral domain and its complement. Let us consider the 
Dirichlet problem 



Lu= 0 in i? or in i7^, . . 

u = g on an, 

where L is introduced in (2.1). 

Since we are interested in local representations near a vertex, we associate 
with (3.2) the Dirichlet problem in a cone 

Kj^ — K = [{r,w),r G ,w e 5+} , 

or in K- = \ K, respectively, where S+ = KC\S‘^ is a, bounded domain on 

the unit sphere with a piecewise smooth boundary curve 7 = dKOS^ and 
where S- = We consider the equivalent boundary integral equation 

of the first kind for the stress field t on dK : 
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Vt{x) := / E{x - y)t{y) dyS = /±(x) for x G dK. 

dK 



(3.3) 



The sign + is used for the interior, the sign — for the exterior problem. 

V is considered as a mapping between weighted Sobolev spaces 

V:V-'^dK)-^vhdK), 

\ 

where {dK) is the trace space of the Sobolev space V^{K). The Sobolev 
space Vq {K) is defined via the norm 




By Vo HdK)^[VoHdK)]' we denote the dual space. 

Lemma 3.1. [38]: The operator V is Vq ^ (dK) -elliptic, i. e. there exists a 
positive constant c such that 

(V(p,ip)L 2 {dK) > c\\^\? _i for all (p e V^^idK). 

Moreover, V defines an isomorphism 

V :VP{dK) ^VQ^idK). 

The Mellin transformation 

oo 

h{a) = j r~^~^h{r)dr for a G C. (3.4) 

0 

will be applied to (3.3) after rewriting (3.3) in terms of polar coordinates 
X = (r,u) and y = (p, '*?)• Then the Mellin transformed equation reads: 

Vt{a,uj) = J a{a,u,d)t{a- l,d)d^j — f{a,uj), 

7 

where the Mellin kernel is given by 

oo 

a{a,u,d) := J E{pu — dp. 

0 

Let us denote by V (a) the operator 

V{a)cp{u) = J a{a,uj,d)(p{d)d^j, 

7 

V{a) : H-^j) H^j). 



(3.5) 
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Lemma 3.2. [38]: The meromorphic operator function V{a) is Fredholm of 
index zero. The spectrum of V (a) consists only of isolated points aj which 
are the eigenvalues ofV{a) having finite algebraic multiplicity vj. 



For the formulation of the asymptotic results we need the definition of opera- 
tor pencils A±{a), which are obtained by applying the Mellin transformation 
to the interior and exterior Dirichlet problem in (or K-). Furthermore, 
we assume that K has m edges which meet at the vertex O. 

Theorem 3.4. [38]: Leth E No,yS E and a E satisfy condition (4-^5) 
of section 4- Suppose that the line Rea — H h-{-\ ~ ~Yhi<j<m ^3 ^ 
is free of eigenvalues of A^{a) and A-{a). Let the right-hand side f± in 

(3.3) be given in {dK) D where this space will be defined in 

(3.7). 

Then the solution t (dK) of the boundary integral equation (3.3) admits 
the expansion 



t{x) = T]{r) 

— ^<Re aj <H 



"E E 

i=0 s=0 



s—k 






+ (3-6) 



k=0 



where Ij and ruji are the geometric and partial multiplicities of the eigenval- 
ues aj and where ^ canonical set of Jordan chains corre- 

sponding to aj. The remainder satisfies t E (dK). 



The weighted trace space Vp ^{dK) is defined as follows: The curve 7 = dS^ 
has m corner points qj. For pj = Pj{w) = \w — qj] and A; E No let Vj[{S±) be 
the space of functions with the finite norm 



Mvj{s±) - 













1 

2 



with ^ e E is the space of functions with the norm 



/ 



v\\vsM±) = 






/ 

Vo 



2{/?+^ o-j-A;) k 



j=i 



^ l|(rar)*v(r, dr 



s=0 






(3.7) 



I 



Finally, Vp f^{dK) is for / = A: — | the trace space to Vp^^(K^) (or Vp^(K-). 



A similar asymptotic expansion is derived in [61] in usual Sobolev spaces, 
where the spectral properties of V (a) are used directly. The connection be- 
tween these two operator pencils will be described in Section 5. 




432 A.-M. Sandig, W.L. Wendland 

4 Asymptotic expansions in the domain 

4.1 The edge-problem 

The general form of edge-singularities for solutions of elliptic boundary value 
problems has been described in many papers (see e.g. [18, 15, 32, 45, 48]). 
Moreover, in [11, 43, 46, 45] formulae for the coefficients in the asymptotic 
expansion are derived. However, the papers [11, 43, 46, 45] contain certain 
conditions for the boundary operators which exclude e.g. the Neumann prob- 
lem. Therefore, we investigated variational solutions not only of the Dirichlet, 
but also of the Neumann and mixed problem for the equations of linear elas- 
ticity, based on ideas of the paper [56] . 

Let G be a bounded domain in with piecewise smooth boundary dG. We 
assume that there is a smooth one-dimensional submanifold M C dG, such 
that dG\M is smooth and G is locally diffeomorphic to a dihedron in some 
neighbourhood of any point z £ M. Assume that dG\M is decomposed into 
two smooth parts Fd and /V and define the subspace 

V = {w G Wl{G) ; w = 0 on Td} 

of W^iG). Then we consider the following variational problem: 

Find a vector-function u £V, such that 

r ^ ^ 

b{u,v):= aihki£ki{u)eih{v) dx 

(F,v) = / fvdx-\- j gvda,yv£V, (4.1) 

Jg Jtn 

where FeV',f£ L 2 {G), g G W^^^(rN). 

Here = ^ik(u) = (dx^ui -f- dxiUk)/2 denotes the components of the 

strain tensor, aihki 2 tre the given elasticities, / is the given volume force and 
g is the surface traction. 

If u G W'fiG) then the variational solution u £V oi (4.1) is also a classical 
solution of the boundary value problem 
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with (Tij{u) — aihki^ki{u) and the exterior normal vector n. 

In this paper we consider the following situations: Fd = 9 (Neumann prob- 
lem), Fm = 0 (Dirichlet problem), Fd,Fn ^ i/i (mixed problem). 

We now assume for the sesquilinear form b that a Carding inequality 

Re b{u,u) + 7ll^i|lL(G) > c||m||'^ (4.3) 

with positive c is satisfied for all u G V. In elasticity, (4.3) corresponds to 
Korn’s inequality. 

Operator pencils 

For the description of the behaviour of the solution of the variational problem 
(4.1) in a neighbourhood of a point xq = (0,0, zq) C M we introduce the 
following two operator pencils: 

The operator pencil A( ): 

Let Dxq = Kxq X M be the local dihedron at the edge point xq with the faces 
r^, where K^q = {{r,cp) : 0<r<oo, 0<(^< (^o} is the plane wedge, 
r = |a: - xo|. Then M = E is the edge of We discard the derivative dxs 
in b{u,v) and get the 2D-sesquilinear form 

b{u,v)= ^ aihkieki{u)eih{v)dxidx 2 . 

i,h,k,l=\ 

Introducing the polar coordinates (r,(^) and using the Mellin transforma- 
tion with respect to r {rdjdr -> a) we get finally a parameter-depending 
sesquilinear form 

a(u, V, a) = (A(a)u, , where 

A(a) ; V^V'; 

V = {w € ipo) : - 0}, = 0, = <po, 7'*' = 1 if 

n I'd,, = 0 and7± - 0 if n = 0. 

The operator pencil A( ) : 

We denote by Q — Qxq and the intersection of Dxq and re- 

spectively, with the unit sphere 5^^ = {x : \x — xq\ = g = 1} . Introducing in 
(4.1) the spherical coordinates {g,(jj) = {g,(p,'d) and using the Mellin trans- 
formation {gd/dg a) we get finally a parameter-depending sesquilinear 
form 

a{u,v,a) = {A{a)u,v), 



where 



(4.5) 
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A{a) :V^V',V = {u£ Wl{Q) ; 7±5 = 0 on g±}. (4.6) 

A{a) = {£(a),B^(a)} can be considered as the operator pencil defined by 
the boundary value problem 

C{uj,du},a)u = f in Q 

B^{u,du,a)u = g± on q^, (4.7) 

where L{dx) = q~‘^C(u!, di^,gdg), = I and g± = 0 if 7 =*= = 1, B^ — 
if 7 ± = 0 with t^{dx) = g~^t^{oj,du,gdg). 

Lemma 4.1. [30, 31]: The spectra of A{-) and A{-) consist of isolated eigen- 
values having finite algebraic multiplicities. For an eigenvalue aj of A{-) there 
exist Jordan chains 

{(f)}, k = 1,. ..pj (geometric multiplicity) (4.8) 
5 = 0,... Kj^k — - Riesz index) (4.9) 

or {Uj'^{uj)} for A{], respectively. 



Asymptotic decompositions 

For the local characterization of solutions we need the weighted Sobolev 
spaces 

Wlp{D) = {u£L^{D)-. Y, [ = < oo}, 

where r = r{x) = dist(x, M) and /? G M. 

Theorem 4.1. [56]: Suppose that the sesquilinear form (4-1) provides a 
Gdrding inequality (4-J) 0 ,'i^d that u ^ V is a solution of (4-1) '^i'th the right- 
hand side 

(r}F,v) ^ / frjvdx-\-^ / g±rjvda, 

Jd ^ Jr 

where f € W^^(jD), g± ^ W^^piF^), 0 < /3 < 1 , and where rj = rj{r) is a 
cut-off function. Furthermore, we assume, that the line Re a = 1 - P is 
free of eigenvalues of the operator pencil A(-). Then the solution u has the 
local representation 



XU = X 



co(^)+ Y + 

(r=l 



j,k,s 
0<Re aj <1-/3 



, (4.10) 



where z ^ M, cq ^ W 2 ~^~^{M), Cj^k,s ^ W 2 ^ ^{M) (e is an ar- 

bitrary small positive number), w E W^ 2 ,/ 3 - 2 (^)’ 9w/dxi E 
I = 1,2. X is O' cut-off function with XV — X- The eigenvalues of A{a) are 
denoted by aj; and are the corresponding Jordan chains. 




Asymptotic Expansions of Elastic Fields and Structural Singularities 435 



Coefficient formulae 

In analogy to [43, 46] it is shown in [56] that 

„_i_s(zo) = / fVj’"{zo)dx+ [ gvf'{zo)da, 

JG JTn 

if the solution u of (4.1) vanishes outside Gfl Dxq- Here xq = (0,0, ^o) ^nd 
the are the so-called dual singular functions. These are singular solutions 
of the homogeneous adjoint problem, which have the form 

?=0 

where {Vj'^} are Jordan chains of the adjoint operator A" (a) belonging to 
the eigenvalue —I — aj and € V. Unfortunately the calculation of the 

vector functions is difficult. Therefore we weaken the condition E V 
by assuming 5 = 0 and 

Vj = cos'd) -f Wj with Wj G W^iQ), (4-11) 

where P^{(p,cosd) are polynomials in cost? whose coefficients are infinitely 
differentiable with respect to ip and satisfy the biorthonormality condition 



for 0 < Re aj < 1 - yS, k,m = l,...pj. 

Theorem 4.2. [56]: LetVj be eigenfunctions of A* (•) to the eigenvalue — 1 — 
Oij which have the form (4A1). Then the following formula holds for the 
coefficients in (4’^0): 



Cj,k{zo) + 



Pj no 

E4”/ 

m=i *^0 



Cj,mi^0 "b Q) Cj,m('^0 q) 



dg 



-L 



fVjdx + Y^ [ g±v]da 



where 

= -{A{aj)cosduf,Vj)Q. 



For some boundary value problems of the Laplace operator and the Lame 
system, the weight functions have been calculated with the help of a 
Computer Algebra Program [40]. 
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Remarks: 

The calculation of the eigenvalues aj of the operator pencil A (a) and of the 
corresponding Jordan chains can be found in different papers (e.g. [14, 59, 
28]). Here we recall some results from [50] for a half-space interface crack 
between two isotropic elastic materials. In this case, the displacement fields 
have the following decomposition near the crack front, j = 1,2: 



XUj{x) = X 



Cl {z)r^ 






0 

0 

uljiV’) 

0 



+ C3{z)r^^ 



0 



+ Wj 



(4.13) 



Here 

1 Ij^ //2(1-2 i/i) -/ii(l-2z/2) 

27T 1-/3’ ^ 



(compare [69, 68] ). /ij denote the shear moduli and Vj the Poisson ratios. It 
follows that 






1 — 2i/2 

1-2Z/1 



(compare Corollary 1, where we recall results of [69] and [68] for the singular 
expansion of the boundary stress fields). The decoupling in planar and an- 
tiplanar states is for anisotropic materials not always possible. Nevertheless, 
the displacement field u{x) has a decomposition analogous to (4.13) where 
nonzero components occur, in general. 



4.2 The corner problem 

The behaviour of displacement fields near polyhedral corner points has al- 
ready been discussed in Section 2.2 in the framework of boundary integral 
equations. Especially, in Theorem 3.4 we have referred to spectral properties, 
which now will be explained in more detail. 

For simplicity we consider the boundary value problems in the polyhedral 
cones K±. Assume that dK = dK^ = U Mj^ where Fj are the faces 

and Mj the edges meeting at the vertex O. 

We consider the boundary value problem: 

L{x, Dx)u± = L±{x, Dx)u± = f± in K+ (or A_) (4.14) 

u±\rj = gf for j e iVi C {1, . . . , m}, 

T{dx,n±{x))u± =T±u±\rk for k £ C 

ATi U A ^2 = {1, . • ■ F N ‘2 =0. Clearly, the Dirichlet and Neumann 

problems are included. 
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Introducing the spherical coordinates x = in (4.14) and applying the 
Mellin transformation we get in analogy to (4.5), (4.6) and (4.7) the operator 
pencil 

^±(a) := {£±(a),B±(a)}.^^ 

m 

H\S±) ^ H-\S±) X (4.15) 



where S± = K± fl S‘^ , 7 j = Fj D S‘^ and where ruj is the order of the corre- 
sponding boundary operator. 

For every edge Mj we use local coordinates {yi , 2 / 2 , 2 / 3 ) centered at the corner 
points qj of *S+. The ys-direction is normal to 5^, (^ 1 ,^ 2 ) is tangential to 5^. 
Similar to the operator pencil A(a) in Section 3.1 we introduce the pencil 
A^(a) for every edge. 

Denote by hf the maximal positive number such that the strip \Rea\ < hf 

j j 

does not contain any eigenvalues of Aj{a). 

For /i € No and a € we assume that 



\h-{- 1 - aj\ < min{/it, hj} for j = 1, . . . , m (4.16) 

or, less restrictive 

\h-\- 1 - (7j\ < hf for j = 1, . . . ,m. (4.17) 

Theorem 4.3. [48, 38, 43, 36, 35]: Suppose that f e Vf^^{K±) fl Vf^{K±) 

(see (3.7)) and gj G vf ^ (Fj) fl i^j) are given where h and 

(Tj satisfy condition (4’^6). Furthermore, assume that the line Rea = H = 
/i -f I — /3 — " 2 eigenvalues of A±{-). 

Then the solution u± G Vq{K±) of the problem (4-^4) has the following 
asymptotic representation: 



If mf-l 



u±{x)=r}{r) ^ r°‘i ^ ^ cfj^ufi^{x) + w^(x), 

~^<Reaf<H *=1 »=« 

where 7] is a cut-off function and 



(4.18) 



± ^ gf A (lnr)(^ ^ . 

(s-k)\ 



(4.19) 



Here * is a canonical system of Jordan chains of A±{-) belong- 

ing to the eigenvalue af where and denote the respective geometrical 
and partial multiplicities. 



Note that this theorem also holds if xq is an edge point since the wedge 
through xo can be considered as a vertex to a specific corner. 




438 A.-M. S^dig, W.L. Wendland 



5 Relations between the corner asymptotics in the 
domain and on the boundary 



We discuss the connection between the spectral properties of the operator 
pencil V{a) (see (3.5)) and the operator pencils A±{a) (see (4.15)) belonging 
to the exterior and interior Dirichlet problem (i.e. iV 2 = 0 in (4.14)). In 
analogy to (4.7) and (4.15) we denote the Mellin transformed surface tractions 
by T±{a). 

Theorem 5.1. [38]: The following equivalence holds between the eigenvalue 
problems belonging to V{a) and A±{a), respectively: The number ao is an 
eigenvalue of V (a) and to, - .. Ak i'S a corresponding Jordan chain if and only 
if the following two conditions are valid: 

(i) ao is an eigenvalue of at least one of the operator pencils A±{a). 

(a) There exist chains e^,...,ej of the operator pencils A±{a) corre- 
sponding to ao; and at least one of them is a Jordan chain, i.e. the 
first element does not vanish. Moreover 

tj = T-(ao)eJ - T+{ao)e'f for j = 0,. . . ,k. (5.1) 

Denote by pij for j E A^o the eigenvalues of F(a) lying in the half plane 
R(a) > — I and by i/j the algebraic multiplicity of fij. To each eigenvalue fxj 
we accompany the z/j -dimensional space Afj consisting of the functions 









i=0 



where to,. .. ,t,^ is any Jordan chain of V{fij). 

We construct a generating system of Mj with the help of Theorem 5.1. There 
exist integers > 0 such that fij — a~ or jjLj == a[f and z/j < + 

Vj. Furthermore, (5.1) yields that Nj is generated by the sum of the 
normal stresses of the chains corresponding to the eigenvalues a~ or a'l of 
the operator pencil A±{a): 



(see (4.18)) We enumerate the elements of this system such that 
Tje^T-u-^, (otT+U+J 

In this way we get again a justification of Theorem 3.4 by a different approach. 
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Theorem 5.2. [38]: Suppose that the assumptions of Theorem 3.4 are sat- 

1 

isfied. Then the solution t {dK) of the boundary integral equation (3.3) 
admits the expansion 

t{x)=j]{r) 51 CjkTjk{r,oj) +t{x) 

with t £ Vo^idK) n V^^^idK). 



6 Graded Mesh Refinement 

It is well known that standard numerical techniques lose accuracy as a result 
of the nonsmooth boundary, which causes loss of regularity of the solution 
in comparison to smooth problems. Therefore a variety of adapted numerical 
methods has been developed, e.g. the augmented Fix method [7, 13, 10, 64] 
based on the knowledge of the singular terms in the asymptotics or graded 
mesh refinements near corners or edges based on the local behaviour of the so- 
lution [2, 3, 6 , 55, 57, 54, 39]. In [5] one finds finite element schemes on locally 
graded meshes for a large class of second order boundary value problems in 
polygonal and polyhedral domains. The grading and the corresponding error 
estimates are based on regularity results in weighted Sobolev spaces and on 
the local asymptotics as described in Section 3. 

The graded partitions are defined as follows: 

Let i? be a polygonal domain in or a polyhedral domain in . We con- 
sider a family of finite element partitions Th of Q with the usual regularity 
properties: 

(i) n = [Jn^eTh where J7g are polygons in E^ (triangles or quadrilat- 
erals) or polyhedra in E^ (for instance tetrahedra or bricks). 

(ii) f]ei n i?e 2 — 0 for ei ^ 62 - 

(iii) Any edge (for n = 2,3) or face (for n = 3) of is either a subset of 
9i7 or an edge or the face of another f]e2- 

Denote by he the diameter of i?e and by pe the diameter of the largest inner 
ball of , then we assume that there is a constant a independent of Th with 

(iv) ^ < cr for all e with fie 

Further denotes h = max^^^GTh K the meshwidth parameter of the family. 
In order to treat the singularities of the solution near the irregular part M 
of the boundary, we assume that the partition Th is graded in the following 
way: 

(v) if n M 0 then ^ he ^ ^ 

if i7e n M = 0 then C_ 2 hr\~^ ^ he < C 2 hr\~^ , 
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where re = dist (i7e, M), and fx G (0, 1] is a control parameter of the grading. 
Note that for jx = 1 we have an unrefined partition. 

Such refinements were studied for the Poisson problem in polygonal domain 
in [27, 52, 55] and in three-dimensional domains with edges in [4]. Approx- 
imation results can also be found in [25, 26]. We here give a generalization 
of these results to polyhedral domains and to more general differential oper- 
ators. 

Let u G V be a weak solution of the variational problem (4.1), where i? is a 
polygonal or polyhedral domain 

b{u,v) =< F^v > G V, and for a given F G (fi-1) 

We consider the Galerkin equations 

b(uh,Vh) =< F,Vh > \/vh eShCV. (6.2) 

Here {^/i} is a family of piecewise polynomial functions with respect to the 
mesh parameter h with the following properties: 

(i) Conformity, i. e. Sh C V. 

(ii) Transformability, in the sense that the elements shall belong to an 
affine or isoparametric family. 

(iii) Approximability, in the sense that lim inf \\u — 

Theorem 6.1. [5]: Let f2 be a polygonal domain in or a polyhedral do- 
main in and Th a family of graded partitions. Then the following error 
estimates for the GALERKIN solution Uh of (6.2) holds 

Ho 
Ho 

and e is a small positive number. 






The number Ho is defined 

— for a polygonal domain near a corner point: 

Ho = Re ao — e > 0, 

where ao is that eigenvalue of the operator A{a) (which is defined in the 
plane case analogously to (4.5)) such that the strip 0 < iJea < i?eao is 
free of eigenvalues. 

— for a polyhedral domain near a vertex, where m edges meet: 

Ho = min{/?eao 4- 

where ao is an eigenvalue of the operator ^(a), defined in (4.5), such that 
the strip -i< Rea < Reao is free of eigenvalues, h'j is defined in (4.16). 
(Here we consider only the interior boundary value problem.) 
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- for a polyhedral domain near an edge: 

Ho — Re ao — e, 

where ao is an eigenvalue of the operator A{a), defined in (4.4), such that 
the strip 0 < i?ea < fleao is free of eigenvalues. 

Numerical experiments have been made in [5] for mixed problems for the 
Lame equations in an L-shaped domain and for the Poisson equation in a 
polyhedral domain with a Fichera corner. 

We underline that the general ideas on graded mesh refinements for FEM 
presented here can be also formulated for Galerkin schemes in BEM. We 
refer especially to [54] where boundary element methods on graded meshes 
are studied for the mixed boundary value problem with the Laplacian in 
polyhedral domains. 
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Local, Residual-Based A Posteriori Error 
Estimates Forcing Adaptive Boundary Element 
Methods 

H. Schulz and W. L. Wendland 



1 Introduction 

The aim of this contribution is to prove local residual-based a posteriori 
error estimates for a wide class of nonlocal operators arising in the boundary 
element method corresponding to the Galerkin discretization scheme. First 
results of this type were proved by Saranen [14, 15], Saranen and Wendland 
[16] and Wendland and Yu [25]. These results provide estimates only on fixed 
parts of the boundary surface, which is not appropriate for adaptive methods. 
Here we present a new result to overcome this disadvantage and present for 
the first time local a posteriori error bounds. Furthermore we present a new 
technique to calculate Sobolev norms with non-integer Sobolev indices based 
on a window technique. 

A different approach to adaptivity was used by Carstensen k Stephan 
who localize in [2] the standard a posteriori residual error bounds to drive 
adaptivity and control the global error. Faermann shows in [5, 6] that the local 
Babuska-Rheinboldt error indicator can also be used to drive adaptivity for 
boundary element methods for boundary integral operators of orders 2a > — 1 
and permits in this case global error inclusions. There it is also shown that, 
for collocation methods with continuous splines of odd order and boundary 
integral equations on curves, her approach also works for 2a = —1. 

In the first section of this paper we give a short introduction into the 
problem and the background of boundary value problems. We then intro- 
duce the Galerkin method and revoke some well known global a priori and a 
posteriori error estimates [9] . 

In general the operators arising in boundary element methods are nonlocal 
pseudodifferential operators, which will be considered in the sections 2, 3 and 
4. The crucial point of section 4 will be the continuity property of pseudod- 
ifferential operators as a mapping between Sobolev spaces. This continuity 
is well known [7, 11, 23], but for the local a posteriori estimates in section 
5 we need the corresponding norms in an explicit form. For this purpose we 
revisit this well known result in theorem 4.1 in section 4. The proof will be 
given in the appendix. 

In section 5 we first present in theorem 5.1 a result similar to the result 
in [16] but having the same disadvantage as mentioned above. The main tool 
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for the proof of theorem 5.1 is the commutator property of pseudodifferential 
operators. That means: If A is an operator of order a € M and u G (7^, then 
the operator [u^A] := {uA — Au) is again a pseudodifferential operator, now 
of order 2a — 1. The function uj plays the role of a truncation function for 
localizing a posteriori error estimates. If the measure of the support of uj is 
proportional to the meshsize then the localizing effects are destroyed if the 
meshsize parameter h tends to zero. Therefore we carefully control the norm 
of the pseudodiflFerential operator [u, A] with respect to decreasing support 
of UJ. This is done in subsection 5.2. Using that result, one obtains a localized 
a posteriori error estimate on mesh-dependent parts of the boundary surface, 
as presented in theorem 5.2. They are based on the localized residual. 

In section 6 we present a new method for the computation of Sobolev 
norms with non-integer Sobolev index. The method is based on a certain 
window technique which is related to the ideas presented in [19, 20] and 
leads to the solution of quasi-local problems i.e. that we have to solve local 
Galerkin equations with a globally defined right-hand side. 

In section 7 we present an adaptive feedback algorithm based on that 
window technique and show its efficiency for two simple numerical examples. 



2 PseudodifFerential Operators 

Here we recall definitions and properties of pseudodiflFerential operators fol- 
lowing the presentations in [11] and [23]. For the following well known defi- 
nitions we consider a bounded domain X C ffi” . 

Definition 2 . 1 . Let 0 < p, 61,62 G M fee given numbers. p{x,y,^) is called a 
symbol of order m G E m the class of symbols x X x E^), iff for 

every multi-index (ai,a 2 ,a 3 ) there exists a constant Cai,a 2 ,az such that the 
estimate 

\D^^D^^Dppix,y,0\ < Mi l«i (2.1) 

holds for all x,y G X. The corresponding pseudodifferential operator B of 
order m is defined by the oscillatory integral 

{Bu)(x):^^^ j j e'^''~^'>^h{x,y,0'u{y)dyd4. ( 2 . 2 ) 

RnRn 

For the following we consider classical pseudodifferential operators, i.e. we 
assume p = I and 6\ = 62 = 0, that means 

+ 1^1)"“'“^'- (2-3) 

We denote the corresponding symbol and operator classes by x X x E’^) 

and 0p5"*^, respectively. Furthermore we set := US'^. 

Sometimes an additional regularization is necessary: 
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Definition 2.2. We define the space 0/ pseudodifferential operators 
as the space of operators B of the form 

B = Bq R (^’^) 

with Bq E OpS'^ and R E OpS^ . 

Next we define pseudodifferential operators on the boundary surface F. For 
simplicity we consider a manifold F E . 

Definition 2.3. Let F € be a compact manifold and {{Ui,^i),i E /} 
(with a finite index set I) a corresponding atlas (see e.g, [26] for definitions). 
Let 

B : C^{F) -^C^(F) 

be a linear continuous operator. Then the operator B is called a pseudod- 
ifferential operator of order m on F, iff for every local chart {Ui,^i), the 
operator 

Biu ;= (5(u o 4>i)) o (2.5) 

is a pseudodifferential operator of order m as in definition 2.1, over ^i{UinF) . 
For simplicity we shall identify the manifold F with a domain X C . 

For solving equations with pseudodifferential operators by using Galerkin 
methods with boundary elements it turned out, that the property of strong 
ellipticity [22] is essential (see [24]). 

Definition 2.4. A pseudodifferential operator A is ca//ed strongly elliptic iff 
there exist a positive constant 70 and a complex function O £ C^{F) such 
that for all x £ F and all rj £ MP with \rj\ = 1, the principal symbol ao{x,r]) 
of A satisfies 

Re G{x)ao{x,r)) > 70 . (2.6) 

This definition is a simplification of strong ellipticity of a p x p matrix of 
pseudodifferential operators, given e.g. in [17]. 



3 PseudodifFerential Equations 

Let us consider a domain Q in N = 2,3, with (at least) Lipschitz contin- 
uous boundary surface F = dfl of dimension n := N - 1; and the equation 

Au = f (3.1) 

with a given pseudodifferential operator A of order 2a and a given right-hand 
side / £ H~^{F). For the unknown solution u we define an approximation 
by the Galerkin-Bubnov method: Find Uh G C H^{F) such that 

(iph,Auh) = {(fhj) 



(3.2) 
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holds for all iph ^ The finite element space on F is defined in 

the sense of [1] with piecewise polynomial splines with polynomial degree d 
and smoothness r (here r = -1 corresponds to the discontinuous splines) on 
a mesh family with maximal meshsize parameter h. 

Example 3.1. We consider the Dirichlet hoxmddxy value problem: 

P{x, D)U = /o for X G /?, 

Tot/ = ^0 for X G r 

where P(x, D) is a given strongly elliptic linear differential operator of second 
order and /o and go are given functions defined on fi and P, respectively. 
7o is the trace operator. Then it is well known [6, 9] that the solution U 
can be represented with the given trace go on the boundary surface and the 
conormal derivative d^U by the sum of a single layer and a double layer 
boundary potential [8] or by a Taylor expansion [19, 20]. One possibility 
to compute the unknown Cauchy data d^U consists of solving a boundary 
integral equation of the form of equation (3.1) where the single layer potential 
operator plays the role of A (2a = — 1) and / is given by the boundary values 
of the double layer potential applied to the given trace function and a 
Newton potential operator applied to /q. 

Example 3.2. We consider the Neumann boundary value problem: 

P{x,D)U = fi for X G i7, 
dpU = g\ for X G P 

Now the unknown trace 70 P = 5^0 can be computed by an integral equation of 
the second kind where the sum of the identity and the double layer potential 
operator plays the role of A (2a = 0) in (3.1). / is given by the single layer 
potential operator applied to g\ and a Newton potential operator applied to 
/i- 

For mixed boundary value problems see e.g. [21]. 

If (3.1) is solved via Galerkin’s method then, for the Galerkin solution Uh 
there holds an asymptotic optimal global a priori error estimate [9]: 

Proposition 3.1. Let A be a bijective strongly elliptic pseudodifferential op- 
erator of order 2a. Then we have the optimal asymptotic a priori error esti- 
mate: 

- '^hWH^(r) £ W'^Wn^r) (3.3) 

with 2a — (d+1) <s<a<t<d-\-l and a < r + 1/2 (N = 2) or a < r 
(N = 3). h is the maximal stepsize. The constant c is independent of the 
exact solution u and the meshsize h. 

Remark 3.1. In [4] a modification of proposition 3.1 for Galerkin-Petrov 
schemes is proved. The extension to systems is done in [24]. 
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For simplicity we atssume that the operator A~^ exists. Then A~^ is again a 
pseudodifferential operator, but of order —2a. From the continuity of pseu- 
dodifferential operators (see [7, 11, 23] and section 4) there follows the global 
a posteriori error estimate 

Qlk/illif— 2«(r) < lk/i||rf*(r) ^ CrWrhWn^^-^^ir) (3.4) 

where Ch •= u — Uh is the error of the Galerkin discretization and r^ 
f — Auh is the residual. The residual is computable, so estimate (3.4) gives 
a computable asymptotic bound for the error. The constants q and Cr are 
the continuity constants of A and A~^ and in general not known explicitly. 
Desirable is now 

1. A localized version of (3.4). First results of this type were proved in 
[14, 15, 16] and [25], but on fixed parts of the boundary surface which do 
not change if h 0. 

2. Methods to compute //^-norms (with a 0 N e.g. a = 1/2 or a -1/2) 
for the localized residual. One possibility to compute i/^-norms with 
a ^ N of the residual is the use of the explicit Sobolev-Slobodecki norm 
as in [5, 6] (see e.g. [26] for the definition). With an uniform mesh and 
B-splines, the norms can be expressed via circulant matrices which are 
used in [12, 13]. A modification leading to sparse matricies was devel- 
oped in [10]. For all these computations, the residual is needed at some 
appropriate points x £ F. By the definition of the residual we have 

No 

rh(x) = fix) - Auhix) = fix) - ^ UiAipiix) 

1=1 

No . 

= / ^iy^y-^)¥>iiy)dsy, 

.UPPV^, 

where Nq is the dimension of and Kix,z) denotes the Schwartz 

kernel of A. For each point x we have to compute Nq integrals numerically 
(including singular or hypersingular and some nearly singular integrals). 
This means, that for the approximation of the residual in Nq points we 
need at least the computation of Nq integrals. 

Another possibility for computing H^{F)-noTms of the localized residual 
is the window technique. There we will obtain that the computation of 
the norm of the localized residual is equivalent to the solution of a certain 
local problem. 




450 H. Schulz, W. L. Wendland 

4 Continuity of PseudodifFerential Operators 



Theorem 4.1. Letp{x,y,^) £ S’^{F x F x M") be a symbol which satisfies 
the decay conditions (2.3). Then the corresponding pseudodifferential operator 
B is a continuous mapping from H^{F) to H^~’^{F) for any s € M and there 
holds the estimate 



< c{s,m)C(^^kofi • ^\s\-2toi^) • ^\s-m\-2koi^) (4.1) 

with io > + |s|), N 3 /cq > |(n + |s - m|) and (see [11]) 

Mt{u) := (27r)-"/2 1(1 + |^|)‘ \u{0\d^- (4.2) 

5 is the Dirac measure at the origin. The constant C^o,^o;0 computed 

from the constants C'ai,a 2 ,a 3 of the decay conditions (2.3) by 



(?«o.*o;a3 (^°)(^) S 12 <^71.72, (4-3) 

t=0j=0 7 / \ J / |T.|=2i|72|=2j 

The constant c(5,m) is independent of the symbol p{x,y,^). 

The proof of theorem 4.1 is given in the appendix. 



Consequence: To control the norm of a pseudodifferential operator B : 
H^(r) it is necessary to control the derivatives of the symbol 

with respect to x up to an order f 1^1) ^th respect to y up to 
an order ko > ^(n-|-|s — m|). This knowledge will be decisive in section 5.2. 



5 Localized A Posteriori Error Estimates 

5.1 Localized A Posteriori Error Estimates on Fixed Parts of the 
Boundary Surface 

In this subsection we show a simple error estimate similar to the estimates 
in [14, 15, 16, 25] by using the commutator property [7, 23] 

||[o;, A]u|U- 2 «+i(^) < C{uj,A) ||t;||H-(r) for all t; E H^{r) {C) 

and for any function a; E C^{r). The operator [cj. A] denotes the commuta- 
tor 

[UjA]v := (ujA — A(jj)v = uj(x) • (Av){x) — (A(a;v))(x). 

The fundamental relation for the development of a localized a posteriori error 
estimate is the error equation: 
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Ach = Vh (5.1) 

and the reformulation of (5.1) into 

Auch -f [cj, A]eh = ujvh (5.2) 

for arbitrary uj. Based on this formulation and the commutator property we 
obtain the following theorem: 

Theorem 5.1. Let A he a strongly elliptic pseudodifferential operator of or- 
der 2a with bounded inverse A~^ . Let Fi be an element of the decomposition 
of the boundary surface in to finite elements and let Fi be a vicinity of Fi, 
i.e. Fi ^ Fi C F. We consider truncation functions oji € C^{F) with the 
following properties: 

1. Ui{x) = 1 for all X G Fi, 

2. suppui C Fi, 

3. Ui{x) > 0 for all x e F. 

Then there holds the following estimate for the localized error: 

^ W^i^hWn^ir) < CrWcoirhWH^^-^^ir) d- 

(5.3) 

with the perturbation terms 

ef'’'\u) = ci,rh^-^+^\\u\\H<(r) 

and 2a — (d 1) < s — 1 < a < t < d 1, a < r 1/2 (N — 2) or a < r 
(N = 3). h is the maximal meshsize. 

Proof For the estimate of the localized error from above we apply the oper- 
ator A~^ to equation (5.2) and obtain 

WiCh = A~^Wirh - A~^ [Wi, A]eh. (5.4) 

Prom (5.4) there follows with the commutator property 

\\^i^h\\H-(r) < \\A~^ujirh,\\H‘>(r) \\A~^[ui,A]eh\\H-{r) 

< Cr • {\\(^irh\\H‘>-^^(r) d- ||e/i||/f^-i(7-)) 

with Cr = \\A~^\\Hs- 2 ccfjs. Proposition 3.1 with 2a-(d-(-l)<5-l<a< 
t < d + 1 leads to 



which concludes the proof for the estimate from above. We obtain for Cr 

Cr = Cg ’ \\A~^\\hs-2oc hs • \\[(jJi, A]\\hs-i • (5.5) 

For the estimate of the localized error from below we obtain in a similar way 
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< ||Awie/i||H— 2«(r) + ||[w<,v4]e/i||^,-2«(/’) 

< imiff'.H— 2“||wieft||/f.(r) + 2«||eh||H.-i(r). 

This leads for q to the expression q = \l\\A\\jj > The norm ||eh||H>-i(r) 
we estimate again by using proposition 3.1. Then we obtain 

ci = cg • TT-jTi ||[a;i,A]||jj.-i ff— 20,. (5.6) 

1 1^1 Iff”, if— 2“ 

Remark 5.1. The behaviour of the commutator norm ^.-20 is 

essential for the behaviour of the perturbation terms in theorem 5.1. Using 
the “inverse” error equation 



A ^Th = Cft 

one obtains another commutator and different Sobolev indices. Based on the 
equation 

A~^iorh + [w, A~^]rh - we/i (5.7) 

one obtains essentially the same inequality as in (5.3). Using this approach 
for the behaviour of the perturbation terms, the behaviour of the commutator 
norm ||[u;i, A-1 ]||h. ^ 2 <x-i jia is decisive. In the context of the generalization 
of the local error estimate (5.3) in section 5.2 to /i-dependent parts of the 
boundary surface this fact will be particulary important. 

Remark 5.2. In theorem 5.1, for the estimate of the localized error in the 
//^(r)-norm, we have used the fact that the error can optimally be estimated 
in the //*“^(jT)-norm in order to obtain perturbation terms which are of one 
order less than the error itself. We have not used the optimal error estimate 
up to the order 2a — (d -f 1). There are two ways to obtain error estimates 
with much smaller perturbation terms: 

- Recursive application of the commutator property. 

- Using the product property: The operator uiBu )2 is a pseudodifferential 
operator of order — (X), provided suppcji fl suppa;2 = 0- 

Both ways have the disadvantage that the constants ci and Cr depend on the 
truncation function u). 

5.2 Generalization of the Commutator Property 

Definition 5.1. Let Fi be an element of the decomposition of the boundary 
surface in to finite elements and let £i> Q be a given number. We define an 
associated vicinity of Fi by: 

:= {x € r : 3y ^ Fi with dist(x,y) < Si} C T (5.8) 

We call a function Ui{x) an admissible truncation function, iff 
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1. suppui{x) c 

2. Ui{x) = I on Fi, 0 < Ui{x) < 1 for all x £ F, 

3. \D^(jJi{x)\ < for all x £ F. 

For brevity we write in the following e instead of Si. 

Lemma 5.1. Let B be a pseudodifferential operator of order m = 2/3 with 
symbol p{x,y,() satisfying the decay condition (2.3). Furthermore let ui be 
an admissible truncation function. Then, for fixed e, the commutator 

(Biu){x) := [uji,B]u{x) = Ui{x){Bu){x) - {Buiu){x) 

is a pseudodifferential operator of order 2/3 — 1. Furthermore, for the symbol 
of {Biu){x) one obtains the decay condition 

01 •(! + (5.9) 

where the constant independent of the behavior of the truncation 

function Ui. 



Proof. Prom the definition (2.2) of the pseudodifferential operator B there 
follows 

(Biv){x) - {2n)~" j J -Wi{y)]p{x,y,^)v{y)dyd4. (5.10) 

R" X 

The difference Ui{x) — uJi{y) can be written in the following manner: 

n 

LUi{x) - Ui{y) = "^{SjUi){x,y) with (5.11) 

3=1 

(^SjUJi)(^x, y ) . j yj—i j x!j , Xj^i , ..., Xfi) 7 •••■) yj—i j 2/3? ^3+1 ’ ^n)- 

Using (5.11) and the relation = i{xj — ^ one obtains 

from (5.10) 

(Bjv)(a;) = (27r)“"(-i)^ J j {x,y)p{x,y,^)v{y)dyd^. 



where 



hj\x,y) 



{SjU>i){x, y) 
~ Vj 



Partial integration with respect to then yields 

{Biv){x) = (27r)~”i J j e'^'‘~^^^^hf\x,y)D^.p{x,y,Oviy)dyd^. 

3=lRn X 
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This leads for the symbol of Bi{x) to the representation 

n 

Pi{x,y,0 (5.12) 

J =1 

Now we have to justify an estimate of the form 

(1 + |^|)2/3-i-I“3|. (5.13) 

The Leibniz rule gives 

n 

DTD^-D‘^^Pi{x,y,0 = i [hf {x,y)Dl^ D^.p{x,y,0] 

i=i 






(5.14) 



Elementary computations show that 

\Dl^D^’^h^p{x,y)\ < j = (5.15) 



holds for arbitrary multi-indices 71 and 72 . Using (5.15) in (5.14) we obtain 
for the decay ofpi(x, t /,0 estimate 



A — |q:i — /?i+Q: 2 — /? 2 | — 1 . 

a; 



\D^'D^"‘D^^Pi{x,y,0 <Co- Y1 (^ 2 )( 

02<^2 

■'^\D^,W^-D^^D^.p{x,y,0\. 

j=l 

Using the decay condition (2.3) of p{x,y,^) we obtain 

D^'D^-D^^Pi(x,y,0\ ^ • E (^') • 

/^ 2<«2 

■CpuP.,a^ (l + l^l)'^"’-'-'"^', 



where 

n 

^/?l,/? 2 ,a 3 ^ V ^/?l)/? 2 ,Qt 3 + lf,i 

i=i 

and C'/?i,/? 2 ,a 3 +i(j) ^re the constants from inequality (2.3). Here l(j) means 
the multi-index with 1 in the j-th position and otherwise 0. So we obtain for 
the decay of Pi(x,y,^) 
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where 



^ai,a2,a3 •“ 



(5.16) 






For e < 1 and 0 < Pi < ai, i = 1, 2, there holds 

|ai— /? i 4 -Q! 2 — /? 2 | — 1 g.— |o!i+a 2 | — 1 

Using (5.17) we obtain from (5.16) the desired estimate (5.9). 



(5.17) 



Using lemma 5.1 we are able to estimate the norms of the commutators [cj^, A] 
and [uji^A~^] (to control the perturbation terms in theorem 5.1) with respect 
to the decay of the truncation functions in dependence on e. 

Lemma 5.2. For the norm of the commutator [a;*, A] there holds the esti- 
mate 

(5.18) 

with £ > n-\-\s — l\ and k > n-\-\s — 2a\. For the norm of [ui,A~^] we obtain 

|l[a;i,A ^]||^^-2«-i^^^ (5.19) 

with £>n-f|s — 1— 2a\ and k > n -f- |5|. The constants C and C are 
independent ofe. 



Proof. We only show (5.18). Then estimate (5.19) follows in the same manner, 
theorem 4.1 gives, with |s - 1| - 2£q < -n and \s - 2a\ - 2ko < -n, 

< cQo,Ajo;0 • M|^_i|_2^o((5) • M\s^2a\-2koi^) 

with C'^o,/ko;0 defined in (4.3), but using there (? 7 i, 72 ,a 3 (from (5.13)) instead 
of C' 7 i, 72 ,a 3 (from (2.3)). From (5.16) and (5.17) there follows 

^ < C • p— I^ i+Q!2|4-1 (k or)\ 

^ai,a2,oc3 '^ai,a2,a3 ^ VO.ZUJ 

and, hence, it is possible to estimate (7 ^o,a;o; 0 (defined in (4.3)) in the following 
way: 



io ko 

Cio,ko;0 = ^tl 

i=0 j=0 



io 



^71,72,0 

l7i|=2i |72|-2j 



io ko 

SEE 

i=0 j=0 






E E 

|7i|=2i |72|=2j 



C ..£-171+721-1 

71,72,0 ^ 



< £ 



-2io—2ko- 



^0 ^0 / 0 \ / 1, \ 

1 / ^0 W \ Y^ 

’ 2L, 2^ )\ j ) 1^ Wi,72,o^ 

i=0 i=0 ' l7il=2i 1721=2? 



where the constant is independent of e. Setting l:=2to > n + |s - 

1 — 2a| and k \= 2ko > n + |s — 2a| concludes the proof. 
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5.3 Localized A Posteriori Error Estimates on Mesh-Dependent 
Parts of the Boundary Surface 



Theorem 5.2. Let the assumptions of theorem 5.1 be fulfilled. Furthermore 
let 0 < 6 < 1 be a given number and cj* an admissible truncation function 
with respect to a boundary element Fi . In addition we assume that the mesh is 
0-regular, i.e. there exist constants 0 > I and cr> 0 such that hmin > . 

The decay parameter €i of the truncation function ui we specify by the relation 



Si >coh] with hi := ir^l 



(5.21) 



and 



7 < 



l-<5 



i and k have to satisfy either 



(5.22) 



a)i > n + |s — 1| and k > n+\s — 2a\ or 
h)i > n 4- js — 1 — 2a| and A: > n + |sj 

and we choose that case where t-\-k is minimal. Then for the localized error 
there holds 






M/ 



with perturbation terms 



(5.23) 






and 2a — (d-fl) <5 — l<a<t<d+l;a<r-f 1/2 (N = 2) or a < r 
(N = 3). 



Proof. We assume that a) is satisfied. From the proof of theorem 5.1 (see 
(5.5) and (5.6)) we conclude for the perturbation terms 



Prom assumption (5.21) with (5.22) one obtains 

• h < • h < ■ h < chf~^ ■ h = ch^. 

Using lemma 5.2 on the norm of the commutator [cj*, A] we find 

< t • • h • h^~^\\u\\Ht{r) < c" • h^ • h^'~^\\u\\Ht{r)‘ 

The case of condition b) follows in the same manner. 

Remark 5.3. Assume s = a. Then the conditions a) and b) for the numbers 
t and k from theorem 5.2 are 

a)^ > n -f |a — 1| and A; > n H- |a| or 
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h)i > n + |a -f 1| and k > n + \a\. 

To minimize i we have to distinguish between the case that A is a pseudod- 
ifferential operator of positive order and the case of negative order. In the 
case of positive order it is obvious that condition a) requires a smaller £ (that 
means a smaller support of cui) than condition b); and vice versa in the case 
of an operator of negative order. 



6 Window Techniques for the Computation of 
Norms of the Localized Residual 

Let (jJi be an admissible truncation function. We define the following family 
of problems: Find functions E H^{r) such that 

= ujif - [Ui, A]uh ( 6 . 1 ) 

holds for i = 1, ...,7Vi, where N\ is the number of boundary elements. Fur- 
thermore we define 

jyW ~ g(») - uJiUh. (6.2) 

Lemma 6.1. Let the assumptions of theorem 5.2 and A = A* be fulfilled. 
Then there holds 

\\uiU - g^^"^\\H-(r) < \\u - UhWn^-^r) < ch^^^~^^\\u\\H^r) (6.3) 

with 2a — (d-\- 1) <s — l<a<t<d-\-l and 

He = \\(^irh\\E-- (6.4) 

Here || • Hf energy norm with respect to the operator A; 

II • He:- is the energy norm with respect to A~^ . 

Proof. For the proof of (6.3) we first consider Cctse a) (see the assumptions 
of theorem 5.2). From (6.1) and the equation 

AuiU = (jJif - [u)i,A]u 



there follows 

A{ujiU - = ~[uJi,A](u - Uh). (6.5) 

Using lemma 5.2 we obtain 

\\u)iU - g^'^^\\H-{r) < IIN?^]||h— i,ff*-2«ll^ - '^^/iIIh— i(r) 

< - UhWn^-^r)- 

Applying proposition 3.1 one gets (6.3). In the case of condition b) we use 
the relation 



[ui,A]^-A[uji,A-^]A, 
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then estimate the norm of the commutator [ui,A] by 

and apply lemma 5.2 for 

Now we prove equation (6.4). From the definition (6.1) of the functions 
5r(*) there follows 

= (jj - - Aui)uh = (jOiVh + AuiUh, 

therefore we have 

^(g(») _ (6.6) 

With the help of (6.6) it is easy to obtain (6.4), since 

- (gW - cjiUh,A{g^^'> -ujiUh)) 

= ~(^iUh,(^irh) = {A{g^^^ -u}iUh),A~^u}irh) 

= {ojirh,A~^OJirh) = ||cJirft|||._. 

Remark 6.1. The identity (6.4) is independent of special properties of pseu- 
dodifferential operators (e.g. the commutator property). 

Consequence: By theorem 5.2, the last lemma and the norm equivalences 

II • He ~ II • ||ff“(r) and H-He IMIff-“(r), 

we obtain an estimate of the error from below and from above 

di||»?^‘^llE -4*^(w) < ||w<eft||ff«(r) < dr||4*^llE + £ft^(w) 
with the perturbation terms 

Now let us define local approximations of the functions and . Let 

^ c H“(ri) 

be a local space of ansatz functions of degree 4*^ smoothness r^\ local 
meshsize and support in We define local functions gj^^ € by 

(¥>, Agi‘^} = {(f, u}if) - {(f, i[wi,A]uh) Vv? G 5f ^ (6.7) 



and 



(0 (i) 



( 6 . 8 ) 
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Lemma 6.2. Let us assume h ‘2 < c\och^ with g > 1. Then we have for the 
differences between the global and the local functions the estimate 





Il9^*^ - 9hl\\E < c -f 4 j ||w||H*(r) 


(6.9) 


and 


Ih^llE - |h2llE < C ||u||„,(r), 


(6.10) 


provided a < t < min{d, + 1, t G N, and with S, 7 defined in 

5.2. 


theorem 


Proof. Equation (6.1) is equivalent to 






(y>,^ 9 W) = {^p,u)^f) - {ip, ([wj, AJuft) \/ip G H°‘{r). 


(6.11) 


Prom equation (6.11) with G 5,-^^ and (6.7) one obtains 






0 - (vP,^(ff«-52)) Vy.G5f\ 


(6.12) 



Using this orthogonality and the definition of the norm || • He: we find 



- 9-2 III = (9^*^ - - 9hb) 

= (9^*^ - ¥>, ^(9^*^ - 92)) ^ • 

Applying the Cauchy-Schwarz inequality and dividing by - 9hl He. this 
gives 

I|9«-92|Ie < I|9(*>-9>I|e Vy.G5f). (6.13) 

With the triangle inequality and taking the minimum with respect to (/? € 
one obtains 

Il9^*^ -92 ||e < Il9^*' - Wiu||f;+ min llwiM-y-llE. (6.14) 

¥>65f> 

With (6.3) and the norm equivalence between II-He and ||■||i^«(/’) we estimate 
the first term of the right-hand side of (6.14) by 

II 5 W -u>iu\\E < c||sr^*^ - < ch*““+'*||u||H«(r) with a < f < d-|- 1 . 

For the second term in (6.14) there holds with some (p* G 
min ||wju - ip\\E = \\oJiU - ¥.*||e < chl~°‘ ||a;<u||Ht(n for a < t < 4 *^ -|- 1 . 

. . ^ c*(2) t / 



From definition 5.1 we obtain 
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The estimate 

< d%2 ^ ^ 

concludes the proof for inequality (6.9). 

Inequality (6.10) follows directly from (6.9) and the definition of and : 






(»)i 



< c (/i‘- 









provided a <t < min{d, (4*^} 4- 1. 



Finally we collect our result in the following theorem: 

Theorem 6.1. Under the assumptions of theorem 5.2, lemma 6.2 and d < d 2 
there holds for the localized error the estimate 

diWVhlWE - sf\u) < \\uieh\\H-{r) < drlhSlU + 4'’^(w) 

with the perturbation terms 

= di,r (/!*-*+' ||«|k.(n- 

The proof follows immediately from theorem 5.2 and lemma 6.2. 

Remark 6.2. For a global error estimation, the result of theorem 6.1 is not 
useful. There are other possibilities, for instance an approximation of the error 
based on the error equation and elements with higher polynomial degree [18]. 



7 Adaptive Methods, Numerical Examples 

7.1 The Adaptive Algorithm 

Let TOL > 0 and 6 £ [0, 1] are given numbers. For the following two numer- 
ical examples we have realized the algorithm: 

1. Compute a Galerkin solution. 

2. Compute the local error estimators := for every element Fi, 

compute rj as the sum of the local values r^. 

3. If < TOL is fulfilled, stop. Otherwise refine all elements Fi with 



Ti ^ 0 ' Tniax 



and proceed with step 1. 

In the following examples we have chosen 9 = 0.01. 
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7.2 Two Numerical Examples 

Example 7.1. We consider the boundary integral equation 

- ^ Jln\x- y\u{y) dsy = /(x), x e T (7.1) 

r 

on the boundary T of a circle i? with radius 1/2. The right-hand side is 
chosen in such a way that the solution of (7.1) is given by 

~ ^ (^) ~ k ~ ^ei I d" C2 In \x — I -I- C3X1 + C 4 ,Xi ■ 

on 

Xei and Xg 2 s.re points outside of the circle, the parameters s\ and 62 represent 
the distance to the boundary. Here we have chosen Xe^ = (0,0.501)^, Xe^ — 
(-0.501,0)^ (si = £2 = 0.001) and ci = C 2 = 1, C 3 = C 4 = 20. To solve 
(7.1) we consider a Galerkin method with piecewise constant trial- and test- 
functions. 

Example 7.2. We consider the same equation and the same method, but on 
the boundary of an L-shape like domain: 

n := {x : X = x{r^(p) = {r cos (p,r sin (p)~^ , r e [0, ^), (p € (0,(^o)} 

with (^0 < 27t. The right-hand side is chosen in such a way that the solution 
of (7.1), given by 

u{x) = with U{x) = U{r,(p) = sin( 7 r— ), 
on (po 

has a singularity at the origin. We choose ipo = 7/4 • tt. 
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Figures 7.1 and 7.2 display the errors with respect to the number of degrees 
of freedom of the adaptive method in comparison with uniform meshes (Ex- 
ample 7.1): 





Figure 7.2: -error 



Remarks: In Example 7.1, the solution is nearly singular, but smooth. For 
that reason the error of the uniform grid refinements converges optimally, 
but there is a long pre-asymptotic region. The adaptive method gives optimal 
convergence with a significantly smaller pre-asymptotic region. 
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Figures 7.3 and 7.4 show the errors with respect to the computing times (sec.) 
of the adaptive method in comparison with uniform meshes (Example 7.1): 





Figure 7.4: -error 



Remarks: The behaviour with respect to the time is in principal the same 
as the behaviour with respect to the number of degrees of freedom. Only in 
the first adaptive steps one observes that the adaptive method is a little more 
time consuming. 
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Figures 7.5 and 7.6 display the errors with respect to the number of degrees 
of freedom of the adaptive method in comparison with uniform meshes (Ex- 
ample 7.2): 





Figure 7.6: -error 



Remarks: In Example 7.2 the solution is nonsmooth. Here the uniform 
meshes cannot produce the optimal order of convergence. For the error in 
the energy norm the adaptive method detects the optimal convergence or- 
der. Also the error in the L^-norm convergences much faster than on the 
uniformly refined grids. 
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Figures 7.7 and 7.8 show the errors with respect to the computing times (sec.) 
of the adaptive method in comparison with uniform meshes (Example 7.2): 




Figure 7.7: Energy error 




Figure 7.8: -error 



Remarks: The behaviour with respect to the time is in principal the same 
as the behaviour with respect to the number of degrees of freedom. Only in 
the first adaptive steps one observes that the adaptive method is more costly. 
A numerical comparison between 11^ localized error in the 

energy norm is given in [18]. 
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A Proof of theorem 4.1 

The following proof resembles the standard proof in [23], however we will 
stress the explicit interaction between the constants in the decay condition 
of the symbol (2.3) and the operator norm in (4.1). 

For the proof of theorem 4.1 we need the following basic inequalities; 

- Young’s inequality; 

< ||/||li'(R») ||ff||L«(R"), (A.l) 

where f * g is the convolution between functions f £ LP and g £ . 

— Peetre’s inequality; 

(l + |xH‘<(l + |x-2/|)'l*l(l + |2/p)‘, (A.2) 

where t is any real number and x,y . 

Furthermore we shall need 

(1 + \x\^y < (l-b < 2^ (1 + \x\‘^y for t > 0 (A.3) 

and 

2‘ (1 + |a;p)‘ < (1 + |x|)^‘ < (1 + |x|2)* for t < 0. (A.4) 

It is necessary to show that there holds: 

\\Bu\\H^-m(^r) < CiQ^ko;0'M^s\_2i^(6)-M^s_^\_2kQ{S)-\\u\\H^(r) Vu G (7^(r), 

(A.5) 

where 6 is the Dircac measure at the origin. We denote by p the Fourier 
transform of the symbol p{x,y,^) with respect to x and to y: 

I I e-'^^^’p{x,y,Odxdy (A.6) 

X X 

In the first step we prove an estimate for the derivatives D^^p{^\r]' With 
(1 - /!*)'’» = (1 + l^'l^)*” 

and 

(1 - = (1 + |jjf )^“ 

there follows by integration by parts 

= (27t)-" (1 + l^f )■*“ (1 + ■ 

. I le-^-"Ul-AyYoil-A,)'^‘>pix,y,Odxdy. (A.7) 

X X 
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This gives 



< c(27t)-” (1 + (1 + IVP)- 



■io 



sup 

x,yeX 






ko 



The Leibniz formula 



(1 - AY° g{x) = (-l)M^g(x) 

U VJ/ 



implice 



D^^l-AyYo{l-A,)’^<^p{x,y,o\< 

EE ('“){>) I°?44p(x.»,«)|. 

i =0 j =0 ^ \ J / 

Hence, we obtain by the definition of the constant Ci^^ko^a^ (4.3) 



D^m-AyY^{l-A,top{x,y,0 

Using (A. 4) we end up with the estimate 



^ ^io,ko,as 



D^^p{^',ri',0\ < c(7,„,fc„;„3(27r)-" (1 + 

(A.9) 

In the next step we show 

(B'um') = (27t)-” 1 1 p{^' - ^ - p',Ouiv') d^drj'. (A.IO) 

RnRn 

We begin with the representation (2.2). Using the Fourier transformation we 
obtain 

{Bu){^') = J J j ed^~^'>^^p{x,y,^)u{y)dyd4dx 

X R» X 

= {2-k)~^'^ j j j e~'^^ ~^^^"^p{x,y,i)dxu{y)dyd^ 



(1 + ICI) 



\-Uo, 



m— laal 



X 



= (27t) "" j j ^ -^,y,Ou{y)dyd^, 

R" X 

where p denotes the Fourier transformed of p with respect to x. Using the 
inverse Fourier transformation, this implies 
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= (27t) j y e -C,y,Odyu{v')d^dr)'. 

RnRn X 

The integral with respect to y is the Fourier transformed of p with respect 
to y. This concludes the proof of formula (A. 10). 

From (A. 10), Peetre’s inequality (A.2) and the inequalities (A.3) and 
(A. 4) there follows 

( 1 + lef |(^)(^')| < (27r)-2«y y ( 1 + 1 ^ - • 

RnRn 

• (1 + |p(^' - - v,OI \Hv')\ d^dv' 



<c{s,m){2ir)-^- j j (1 + |^ - ' • 

RnRn 

• (1 + (1 + lei)-'" m' -u- i«(V)i «• 

Applying Peetre’s inequality again yields 
(1 + l^f \{^u)ie)\ < c(s,m)(27r)-"” j + ’ 

RnRn 

. (1 + 1^ - VI)'*' (1 + 1^1)-'" (1 + |r?T)*^' - v',0\ l«(V)l «• 

Using the decay condition (A.9) of p(-, •, •) (with as = 0) we obtain 



(1 + ri) 



(s—m)/2 



(B'u)ia\<c{s,m)i2ir)-^-CtMj y (1 + 1^ - VI)'*' 

Rn Rn 

(A.ll) 

(1 + 1^ - Vl)'*-'"'-^'" (1 + iVn*^' l«(»?')l d^drj'. 



The right-hand side of (A.ll) can be written as a three-fold convolution of 
three functions hi , hz and hs : 

{hs * /i2 * hi) (^') = j (h2 * hi) (^) • hs{^' - 0 

Rn 

= 11 h2{^-r,')-h3i^' -Od^hi{ri')dv', 

RnRn 

Using Young’s inequality (A.l) with r = 2, p = 1 and q = 2 two times gives 
llha * h2 * hi||L2(R«) < ||h3||Li(R«) • ||h2||Li(R«) • ||hl||L2(R~). (A. 12) 

With hi(-) := (l + I • |^i(*)|j h2(0 := (1 + | • |)'^^ and hs{-) := 

(1 + I • we obtain from (A.ll) and (A. 12) 
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||(1 + |. 12 )(— 

< c(s,m)(27r)“''^”Cf(,,fco;o|l/i3|U'(R’‘)||^2||L'(R’')ll (^ + I ' 1'^) ^ ^(OIU^CR")- 

By the definition of the Sobolev norm this is equivalent to 
llBu||ff.-m(Rn) < c(s,m)(27r)“^"C'<o,fco;0 • ||/i3||Li(R’*)||h2|Ui(R>>)||n||H*(R»). 

(A.13) 

Using 

||h2|UMR’>) = / (1 + 1^1)'*''''“ = (27 t)"M|*|_2,„(5) 

Rn 

for 2£o > n -f- |s| and 

Rn 

for 2A:o > n + |s - mj, we obtain from (A.13) the estimate for the operator 
norm in (4.1). □ 
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Domain Decomposition and Preconditioning 
Techniques in Boundary Element Methods 

0. Steinbach and W. L. Wendland 



1 Introduction 

Domain decomposition methods are powerful tools to parallelize the imple- 
mentation of boundary element methods, especially if different material pa- 
rameters in substructures appear. Moreover, such an approach is well suited 
for the coupling with other discretization schemes such as finite element meth- 
ods; and for problems in infinite regions. 

Boundary element methods are based on equivalent boundary integral 
equations of corresponding mixed boundary value problems. We restrict our- 
selves to the direct approach for selfadjoint problems of second order, where 
the mapping properties of the resulting boundary integral operators are well 
known [6, 7]. The basic tool for the boundary element domain decomposition 
formulation is the Calderon projection in terms of two boundary integral 
equations from which we can derive two representations of the Dirichlet- 
Neumann map based on the so-called Steklov-Poincare operator. According 
to the choice of these operators, we distinguish between the symmetric and 
the nonsymmetric formulation. The latter is based on an integral equation 
involving only single and double layer potentials, which allows us to use ei- 
ther Galerkin or collocation methods for their discretization. This approach 
is used for coupling finite and boundary element methods in a nonuniform 
way, see e.g. [14, 24]. 

The symmetric formulation is based on both of the boundary integral 
equations given by the Calderon projection [17] and is widely used in sym- 
metric coupling of finite and boundary element methods [5, 8, 9]. 

Both formulations lead to algebraic systems of linear equations of the 
same type; however, in the nonsymmetric formulation, the resulting system 
is symmetric only up to a small perturbation. For the solution of these systems 
one can use different iterative schemes like Schur complement algorithms or 
transformation-based solvers like the Bramble /Pasciak approach for skew- 
symmetric positive definite systems [1, 8], which can be generalized to the 
nonsymmetric case, too. However, these methods require the use of efficient 
preconditioners to keep the number of iterations small which are needed for 
a given relative error reduction independent of the discretization parameters. 
Such optimal preconditioners can be classified into two classes: the local type 
preconditioners for the single layer potential operator to realize the action 
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of the local Steklov-Poincare operators, and of global type for the assem- 
bled Schur complement, which corresponds to the globally defined discrete 
Dirichlet-Neumann map. 

Preconditioners for discretized boundary integral operators are well known 
in the two-dimensional case for uniform meshes, where one can use circulant 
or Toeplitz matrices approximating the boundary integral operators on a cir- 
cle [13], or one may use fast realizations for the Steklov-Poincare operator 
and its inverse by solving finite element schemes for the underlying partial 
differential equation on uniform meshes in the domain. For both methods 
there are generalizations to the 3D-case [10, 11], which are restricted to par- 
tially uniform meshes and a suitable ordering of the geometrical nodes and 
elements. 

Here we will give a general approach for the preconditioning of discrete 
boundary integral operators, which is well suited for the symmetric formula- 
tion and which has no restrictions to the space dimension and the discretiza- 
tion scheme used [22]. 

The construction of global preconditioners for the discrete Steklov-Poin- 
care operator in its symmetric representation is based on the spectral equi- 
valence to the Galerkin discretization of the hypersingular integral operator. 
This allows us to construct several preconditioners of hierarchical type, where 
the spectral condition number depends on the relation between the local and 
the diameter of the substructures. Due to the spectral equivalence described 
above, we may also take the discrete hypersingular integral operator as an op- 
timal preconditioner of the global Schur complement system. The realization 
of this preconditioner, i.e. the inversion of the discrete hypersingular opera- 
tor can be obtained either by multigrid schemes or any other preconditioned 
iterative methods. 

Based on these preconditioners we construct fast algorithms for the itera- 
tive solution of linear algebraic systems resulting from mixed boundary value 
problems in a variational formulation with respect to a given domain decom- 
position. Numerical examples will demonstrate the efficiency of the proposed 
methods. 



2 Variational Formulations for Mixed Boundary Value 
Problems 



Let us consider a selfadjoint and elliptic boundary value problem of 2nd 
order in a bounded domain i? C with n = 2, 3 and a Lipschitz continuous 
boundary JT = 9i7, which is decomposed into two distinct parts Fd and /V, 
where boundary conditions of Dirichlet or Neumann type, respectively, are 



given: 



L{x)u{x) 


= fix) 


for X £ 1 






= 9(x) 


for X £ Fd, > 


(2.1) 


£iu{x) 


= h{x) 


for X £ F]sf- J 
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Here, the differential operator is given by 



L{x)u{x) — — 




-h c{x)u{x ) , 



( 2 . 2 ) 



whereas the £j are the corresponding boundary operators with Cqu 



ditional trace and 






the tra- 



the conormal derivative with v= (z/i , . . . , v^) the exterior unit normal vector 
on r. E.g. in linear elasticity are the boundary displacements while t\u 
represents the boundary stress. 

For a nonoverlapping domain decomposition 



p 

i7 = Qi O Qj for i ^ j, Fi := 917^, Fij := Fi fl Fj (2.3) 

i=l 

with the skeleton ^ 

rs := U Fi (2.4) 

i=l 

we suppose, that the coefficients of the differential operator (2.2) are piecewise 
constant: 

— (Ae » X £ Qi. 

Hence, the fundamental solutions of the differential operators exist locally, 
i.e. for alH = 1, . . . ,p there exist functions E^{x,y) satisfying 

L{x)E\x, y) = S(x ~ y) • l{x) for x, y € i7i . 

As in most applications, we choose the local trace operators FjU{-) for j = 0, 1 
and i = 1, ... ,p as 

Ui{x) = Fqu{x), U{x) = £\u{x) for x £ Fi. 

Then the solution u{-) of the mixed boundary value problem (2.1) is given 
by the local representation formulae 

it(a:) = j ti(y) ■ llE\x,y)dsy - j Ui(y) ■ £\E*{x,y)dsy + J f{y) ■ E\x,y)dy 

n Fi Oi 



for X £ Qi and i = 1, . . . ,p. In addition to the boundary conditions 



^i(^) = 9{^) for X £Fi£\FD, 
ti{x) = h{x) for X £ FiC\ Fn, 



the densities U{ and U have to satisfy homogeneous transmission conditions 
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Ui{x) - Uj(x) = 0, ti{x) + tj{x) = 0 for a: G , (2.5) 

which may be generalized to inhomogeneous conditions, too. Associated with 
the fundamental solutions £*(•, •), we define the common boundary integral 
operators by 

{Viti){x) = j {ei{y)E\x,y)) U{y) dsy , 
r. 

{KiUi){x) = J {i\{y)E\x,y))^ Ui{y) dsy , 

Fi 

{K'iti){x) = j {e\{x)E^(x,y)) ti{y) dsy , 

Fi 

{DiUi){x) = ~e\{x) J {i\{y)E\x,y))^ Ui{y)dsy 
r» 

for X e Fi and the boundary traces of the volume or Newton potentials by 
(N^f){x) = tj j E'{x, y)f(y) dy for x £ Ei and j = 0, 1. 

Qi 



In terms of the Calderon projector the resulting local boundary integral equa- 
tions can be written as 






- Ki Vi 

Di hl + K, 






( 2 . 6 ) 



^OT i = 1, . . . ,p. These equations imply the nonsymmetric representation of 
the local Dirichlet-Neumann maps 



U = Vr^ (^i + K^Ui-Vr^Nlf 

as well as their symmetric representation 

ti = [{ll + K()V-^{^I + Ki)+Di]ui 

+mf-{^I + Ki)Vr^N*f. 

This may be written as 

ti — SiXLi fi 

by using the Steklov-Poincare operator 

SiUi = vr' + Di 



(2.7) 

( 2 . 8 ) 
(2.9) 



and 
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fi = Vr^Nlf = Q/ + K'^ Vr^Nif - Nif . 

All the boundary integral operators involved are pseudodifferential operators 
of orders - 1, 0, +1, respectively, and their mapping properties are well known, 
c.f. for instance [4, 6, 7]. 



2.1 The nonsymmetric formulation 



The variational formulation of the mixed boundary value problem (2.1) with 
respect to the domain decomposition (2.3) satisfying the transmission condi- 
tions (2.5) has the following form: 

Find u E with ioU\rD ~ 9^ 



p P p 

'£,{SAu,ei = / h-iov\rNdsx + ( 2 . 10 ) 

*=i ri »=i 



for all V e with — 0. 

Here, the continuity of u across the coupling boundaries rij is satisfied in the 
strong 77^(i7)-sense, while the compatibility conditions of the stresses U are 
required in a weak form. This concept corresponds to variational formulations 
for finite element domain decomposition methods and is therefore also well 
suited for coupled finite-boundary element methods [8]. 

In the contrary to (2.10), one can use the inverse Steklov-Poincare opera- 
tors to require continuity of the stresses U and a weak coupling of the Ui [14]; 
generally one may take the weak coupling of both components, this would 
lead to the so-called mortar element methods. 

Due to the i/^/‘^(r 5 )-ellipticity of the sum of the Steklov-Poincare op- 
erators involved in (2.10), the variational problem has a unique solution. 
Moreover, for a finite-dimensional subspace 

Vh = span{^^}f=i C H^/\rs) 

of spline functions of polynomial degree /i we have the quasi-optimal error 
estimate by Ceas lemma, 

for the solution uh of the variational problem (2.10) where inf is taken 
over the subspace Vh with corresponding boundary conditions. However, the 
Steklov-Poincare operators involved admit no explicit discretization. Hence 
we introduce approximations of the local Steklov-Poincare operators Si by 

S^eivH ■■= tiewl = spanK.jfli C (2.11) 

satisfying the local finite-dimensional variational boundary integral equations 
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{Vitl4)L-{n) = + ( 2 . 12 ) 

for all tI £ Wl C This approach includes Galerkin- as well as 

collocation schemes depending on the choice of the test spaces to compute 

the approximate Steklov-Poincare operators S^. If we suppose quasi-optimal 
error estimates (see e.g. [16]) for the local problems (2.12), i.e. 

||(5f - 5i)£o^i/||/^_i/2(/-.) < C' •\\SitQUH\\w{r) (2.13) 

with — ^ < t and t | for n = 2, t < // for n = 3, then we can 

derive the following existence and convergence results for the solution of the 
approximate variational problem: 

Theorem 2.1. [14, 24] Let h,H be the maximal diameters of the finite ele- 
ments in Vh and Wh respectively. If the condition h < C • H is satisfied with 
an appropriate constant C depending only on the ellipticity and boundedness 
of the single layer potential operators V{ in (2.12) and on the constant in 
the inverse inequality for Vh, then the sum of the operators is positive 
definite and bounded. Hence, there exists a unique solution uh ^ Vh with 
^qUhifd = 9 satisfying the variational problem 

p 

= Hvh) (2.14) 

i=l 

for all test functions vh € Vh with jo^HirD ~ 0 have the error 

estimate 

W'^ ~ '^hWh^/^^Fs) ^ ~ + IK'5' - *5^)w||H-i/2(r5)} • 

(2.15) 

The proof is due to the first Strang lemma (c.f. e.g. [3]) based on the local 
error estimates (2.13) and uses the inverse property of the trial space Vh- 
The discretization, either by a Galerkin or collocation method, leads now 
to the system of linear equations 

Ml (I + = /, (2.16) 



with Mji = Mh in the Galerkin-Bubnov case. This is algebraically equivalent 
to the coupled system 



( -\Mh 

(m- 0 jUj-Ur 



(2.17) 



note, that (2.16) is the Schur complement system of (2.17). 
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2.2 The symmetric formulation 



Instead of the approximations of the local Steklov-Poincare operators Si one 
may use the symmetric formulation [5, 17] to discretize the original variational 
problem (2.10). This approach is based on both boundary integral equations 
in (2.6) and leads to the following variational formulation [9]. 

Find u E with iou\ro U ^ with = h for all 

i — 1, . . . such that 



|(^^o^j^o^)L2(ro d- ’^{ti,io'^)L^(ri) + = f‘2{v) 

{Viti,Ti)L2(^n) - n)L^(ri) - {K^do'^^Ti)L^(ri) = fiin) 

holds for all test functions v G with ^ov\ro ~ 0 ^ 

with Ti\rN = 0 /or i = 1, . . . ,p. 

Here, the linear forms on the right hand side are given by 



f 2 (v) = / h{x) ■ iov{x) dsx - Y^{Nlf,tov)L 2 (ri) 

Fn 



fiin) = -(iV*/,r<}^.(r,) . 



Since the associated bilinear form is positive definite and bounded, the varia- 
tional problem is uniquely solvable and we get quasi-optimal error estimates 
for the corresponding solution in finite-dimensional subspaces without ad- 
ditional conditions. The Galerkin discretization of the variational problem 
leads to the skew-symmetric system 







(2.18) 



where is the Galerkin matrix which couples the unknown Dirichlet 

datum on ZV with the unknown Neumann datum on To while Kh = \^hr^^h 
is the Galerkin matrix for the coupling boundaries Fc Fs\r. 

Applying a block Gauss-Seidel step to (2.18) we get the Schur complement 
system with the matrix 



Dft + 2L=l,- = I 



(2.19) 



which is symmetric and positive definite. Hence we can use the preconditioned 
method of conjugate gradients to solve (2.19) iteratively. In general, we have 
to assume, that there are two preconditioners available, i.e. matrices Cy and 
Cs satisfying the spectral equivalence inequalities 



cY • (CvLt) < {VhLf) < • (Cvt.t) 
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for all vectors t associated with the (local) trial functions 4 and 
cf • (Csu,u) < {{Dh + klV^^kh)u,u) < 4 • {Csu,u) 

for all vectors u associated with the (global) trial functions Uh, where all 
constants should not depend on the actual discretization parameters. We 
will give a general approach to construct such preconditioners in chapter 3. 
For the construction of preconditioners to the discrete symmetric Steklov- 
Poincare operator 

§„ = D^ + kJV^^kh, ( 2 . 20 ) 

which is only an approximation of the Galerkin discretization of the Steklov- 
Poincare operator 5* defined in (2.9) given by the matrix entries 

Sh[e,k] = (2.21) 

i=l 

for appropriate trial functions of degree fx, we invoke the following basic 
result. 

Theorem 2.2. [22] The matrices defined in (2.20) and (2.21) are spectrally 
equivalent, i.e. we have the inequalities 

-{ShU^U) < {ShU,u) < C2'{ShU,u) 

for all vectors u associated with trial functions Uh £Vh, where the constants 
do not depend on the discretization. 

Therefore it is only necessary to construct preconditioners for the single 
layer potentials V/^, and either for the Steklov-Poincare operator Sh or for 
Dh, respectively, which are Galerkin approximations to pseudodifferential 
operators of order ±1. Obviously, there is a strong relation between these 
operators, which we will use for the construction of efficient preconditioning 
techniques in boundary element methods. 



3 The Construction of Local Preconditioners 

Let us consider strongly elliptic and self-adjoint pseudodifferential operators 
A, B of orders ±2a, respectively, in appropriate Sobolev spaces on F: 

A : H%F) ^ i/"-2^(r), B : H^-‘^^{F) H\F ) . 

With the admissible quotient spaces 

= «*(n/kerX . W' = 

we assume the coerciveness and boundedness inequalities 
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‘ IKIlH^(r) — ^ Vt G y , 

cf • ||'«||H>-2»(r) < (Su,u)H,-«(r) < cf ■ l|u||H.-2„(r) Vu G W. 

For the nontrivial finite-dimensional nullspace kerS we use an orthonomal 
basis representation 

ker5 = {v€ : Bv = 0) = span{t;fc}^=i 

with 

{'^k,vi)H-- 2 c(^r) = hi for all 
Hence we can introduce the quotient space 

G H%r) : {t,Vk)H‘-Hn = O for allt;fc G kerS} ; 
and the pseudoinverse operator 

^-1 . ^ W 

is then well defined, i.e. B~^ is positive definite and bounded for all 
t € Thus we have immediately 

Lemma 3.1. For every t £V C\ V^^^{B), the inequalities 

ji ’ {B~^t,t)H^-oc(r) < < 12 ' {B~^t,t)H^~o^{r) 

are satisfied with positive constants 71 = • cf and 71 = c.f • cf . 

To construct a preconditioner we have to split the space V into an orthogonal 
sum 

y = ^ . (3.1) 

To this end, we characterize the Bessel potential operator 

J : H^{F) 

for u G by the variational formulation 

{Ju,v)H“-^{r) = for all G 

Then, for any t EV there exists the uniquely determined projection 

m 

to{x) = t{x) - '^{t,Vj)H-‘-(r){Jvj){x) G V^’°{B). 
j=i 

Now we can formulate one of the basic results for our preconditioning tech- 
nique [22]: 
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Theorem 3.1. The bounded linear operator defined by the bilinear form 

m 

«(r) “1" «(r) (3.2) 

A;=l 

for all t^r £V is spectrally equivalent to A, i.e. there hold the inequalities 
7 i • c{t,t) < {At,t)n^-c.(^r>^ < 72 • c{t,t) for all t eV 
with positive constants 

7i = min{l, cf }, 72 = • max{l, cf }. 

Let a finite approximation space Vh CV he given by 

Vh = span{(^^(0}f=i, 

e.g. by the smoothest B-splines ^^i') piece-wise polynomial degree u. Then 
as a direct consequence of Theorem 3.1, the matrices A^ and Ch defined by 
their elements 

Ah[i,k] = (Aip''k,(pf)H-<‘{r) &ndCh[i,k] = c{(pl,(pf) for k,i = 1, . . . ,N 

are spectrally equivalent, i.e. the spectral condition number of the precondi- 
tioned system C^^A^ is bounded by a constant independent of the discretiza- 
tion parameters like mesh adaptivity or the degree of the trial functions used. 
To use the proposed preconditioner, one has to discretize the bilinear form 
(3.2) and to realize the multiplication with in the most efficient way. 

For the orthogonal complement to we suppose a representation 

in terms of the above basis of V/^, i.e. 

N 

{Jvk){x) = 

J=1 

By the partial Gram-Schmidt qrthogonalization, 

m 

(fl’^ix) = ipkix) - ■ {Jvj){x) € 

i=i 

we find the discrete analog to (3.1) by the basis transformation 
Vh = span{y>J',...,y)5;f} = 

For a given function th € this may be written as 

N N—m m 

TW = + '^tN-m+k ■ (JVkfix) 

k=l k=l k=l 



with the relations 
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tk = Oik + 

e=i 



N—m 

tN-m-i-i - ^2 ij 

i=l 



for A: = 1, . . . ,iV and 




for A: = 1, ... ,7V - m, 
for A; = TV - m -f 1, . . . , TV. 



It can be shown [19], that this basis transformation is transposed to a trans- 
formation T defined as follows: Let the Galerkin weights fk = fi^l) ^ 
linear form /(•) be given. Using the transformed trial functions we find 

N 

fN-m+i ■■= f{Jvi) = V(j ■ fj for £ = 1, . . . , m and 
j=i 



m 

fj •= = fj - «(r) • fN-m+l 

t=0 



for j = 1, . . . , TV - m ; in short / = T/. 

Due to the orthogonal decomposition of the trial space 14, the Galerkin 
discretization of the bilinear form (3.2) with respect to the transformed trial 
functions leads to the block diagonal matrix 



Ch 




with the matrix elements 

Co[i,k] = ioxk,l=l,...,N -m. 

We still have to compute these Galerkin weights, where the pseudoinverse 
operator B~^ is in general not given in explicit form. Hence we have to find 
an approximation u\ of the exact function uq = ^ W ior any function 

to € This can be done by solving the finite-dimensional variational 

problem 

{BuQ,VQ)H—<^{r) - «(r) 

for all test functions Vq in an appropriate subspace C W 2 is follows. 

For a finite-dimensional subspace Wh C with the basis repre- 

sentation 

Wk = spanK(.)}f=i, 
we define an orthogonal decomposition 

Wh = < © {t;fc}r=i with C W 

with the transformed trial functions 
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• Mx) . 

1=1 

Then the discretization of the pseudoinverse operator B~^ leads to the matrix 
representation 

Co = MhB^^M;[ 

with the entries 



Bh\j,i] = {B<pf,ip^)H-<>ir), 

m 

^h[j,k] = {<Pk,<fj)H‘-o(r) - 2«(r) 

e=i 



for i,j = 1,. M — m,k = I, ... ,N — m. Prom the additional condition 



c 






< 



sup 



°(r)| 

IKllH'-=»(r) 



for all th € 



(3.3) 



then immediatly follows the spectral equivalence of Cq and Cq. On the other 
hand, an efBcient preconditioner requires the action of which in this 
case can easily be done by 

Co-V= M-^BhM;^ 

if we suppose the invertibility of which corresponds to (3.3) with N = M. 

Due to the batsis transformation introduced above, we have: 

Lemma 3.2. The modified mass matrix Mh invertible if and only if the 
mass matrix defined by 



Mh[i,k] = 

for k^t= 1, . . . , iV is invertible. 

For the special choices u = n 2k with k £ /No , the mass matrix Mh is 
strongly diagonal dominant, hence there exists the inverse independent of h] 
and condition (3.3) is satisfied. Then the action of the proposed precondi- 
tioner is given by 






M, 



-T] 



h 

0 



-1 



T. 



To demonstrate this technique, let us consider the Laplace equation, where 
A is the single layer potential operator of order 2a = — 1 given by 

(Vt)ix) = j E{x,y)t{y)dsy 
r 
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For = V -\-2k with k E /Nq the modified mass matrix is given by 

Mh = 

where the inverse can be computed from the Sherman-Morrison formula [12], 
with 

a = . 

Note, that Mh is sparse and strongly diagonally dominant, hence the inverse 
can be computed by applying some steps of the Jacobi iteration to get a spec- 
trally equivalent approximation of Moreover, for n = 2 and i/ = = 1, 

the mass matrix Mh is tridiagonal due to the basis transformation. Then, 
can be computed directly in optimal order by a Cholesky decomposi- 
tion. 

However, the basis transformation used may produce some difficulties in 
applying these techniques to more general problems, like in linear elasticity. 




484 O. Steinbax:h, W. L. Wendland 



where we have up to 6 eigensolutions of the hypersingular integral operator. 
To avoid these complications, we modify our preconditioner in the following 
way. Since the hypersingular integral operator associated with the Laplacian 
is positive definite and bounded with respect to the | • |/^i/ 2 (/^)-semi-norm, 
i.e. 



{DujU)L^(^r) ^ ^2 ' W\H^/2(r) Vw € (3.4) 

then it follows immediately, that the modified operator 



B = a - 1 D 



is positive definite and bounded with respect to the norm in For 

a £ [cf , c^] we have the same constants as in (3.4). In this approach, no ba- 
sis transformation is needed, which clearly can be extended to more general 
problems without any difficulty. Note, that for more general equations, the 
above modification only depends on the nontrivial nullspace of the hyper- 
singular integral operator D [22]. The amount of work is of the same order 
as the multiplication with the original matrix where in the symmetric 
approach to mixed boundary value problems and in domain decomposition 
methods both operators are still discretized and therefore available. 

In general, this technique can be used to construct efficient precondition- 
ers for boundary integral operators, if pairs of operators of order ±2a are 
available. There are no restrictions to the space dimension. Moreover, this 
method is also applicable, if F is not a closed boundary curve, as it appears 
in mixed boundary value problems. 

At the end of this chapter let us present some numerical examples. For 
an L-shaped domain without further decomposition we consider the Laplace 
equation with Dirichlet boundary conditions to check both preconditioners 
proposed for the single layer potential. The Galerkin discretization is done for 
z/ = /i = 1 and an uniform mesh, N denotes the number of degrees of freedom, 
K,{C^^Vh) is an approximate bound for the spectral condition number and 
Iter and sec mean the number of iterations and the time needed, respectively, 
in order to achieve a relative error reduction of 10“® in the preconditioned 
conjugate gradient iteration. 

Table 3.1. Preconditioners for Vh. 









WSMESsSMUM 


N 




Iter 


sec 




Iter 


sec 


64 




8 




1.85 


8 


0.01 




■Bl 


8 




1.92 


8 


0.04 






8 




1.99 


9 


0.21 


512 


■BH 


8 


1.42 




9 


0.96 


1024 


BIB 


8 


6.42 


2.09 


9 


3,80 


2048 




8 


24.18 


2.12 


9 


15.18 
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From the numerical results given in Table 3.1 we conclude experimen- 
tally the optimality of the preconditioners proposed. As we can see, the batsis 
transformation guarantees a sharper estimate of the spectral condition num- 
ber, however, the numerical amount of work of the modified preconditioner 
without this basis transformation is significantly less. 

Obviously, we can use this technique in the opposite direction to get a 
preconditioner for the discrete hypersingular integral operator and, due to 
Theorem 2.2, also for the discrete Steklov-Poincare operator. Table 3.2 shows 
the numerical results. 



Table 3.2. Preconditioner for Dh- 



N 




KiC^Dn) 


Iter 


sec 


64 


1.77 


9 


0.01 


128 


1.80 


9 


0.04 


256 


1.82 


9 


0.19 


512 


1.82 


9 


0.93 


1024 


1.83 


9 


3.63 


2048 


1.83 


9 


14.50 



Finally, we conclude this section with the statement, that the proposed 
preconditioning technique is optimal for the symmetric formulation of bound- 
ary integral equations to handle mixed boundary value problems by domain 
decomposition methods. For u = fi, all boundary integral operators needed 
for preconditioning are discretized anyway, so this method seems to be the 
most natural one. Moreover, there are no restrictions to the geometrical dis- 
cretization and to the underlying partial differential equation either. 



4 Hierarchical Domain Decomposition Preconditioners 

Due to Theorem 2.2 it is sufficient to construct a spectrally equivalent pre- 
conditioner for the Galerkin discretization of the Steklov-Poincare operator 
given by the bilinear form 

p 

a{u,v) = '^(Sifo'u,4v}L^(ri) (4.1) 

i=l 

for UjV £ To this end we recall, that a boundary element domain 

decomposition method can be considered as a parallel realization of a Trefftz 
formulation. Hence it seems to be natural, to decompose any given function 
u G by 

u(x) = uh(^) -I- u(x) for X £ Fs (4.2) 

into a coarse grid part uh{’) and a fine grid function u(-), where the latter 
vanishes at all global coarse grid nodes xc € cj. Here, uj is the g-dimensional 
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set of all coarse grid nodes of the given domain decomposition (2.3). Obvi- 
ously, this decomposition is only unique, if we choose the ansatz 

Q 

uh{x) = Y^u{xc,k)-^ki^): (4-3) 

k=l 

where are harmonic trial functions, which are piecewise polynomials 

on Fs and satisfy the given partial differential equation with homogeneous 
right hand side in the subdomains 
If we define by 

Vh(I^s) = span{v?f}*^j 

the trial space of the coarse grid functions, we can define a unique basis 
transformation 

T :Vk{rs)-^vUrs)^v^{rs), 

of the usual complete trial space Vh{Fs) onto a hierarchical trial space. If we 
denote by Ih the piecewise linear interpolant of a coarse grid function, this 
basis transformation is given by 



T = 





(4.4) 



provided a corresponding ordering of the coefficients u of the original fine 
grid function Uh- 

The Galerkin discretization of the variational problem (2.10) with respect 
to the hierarchical trial space now leads to the reordered algebraic system 
with the stiffness matrix 



S hierarchical 



Shh ShH 



(4.5) 



Prom this representation we derive different preconditioners for the globally 
defined discrete Steklov-Poincare operator based on the relation 



Sh — T • Shierarchical ’ T . 



(4.6) 



If we are given a hierarchical preconditioner C hierarchical which is spectrally 
equivalent to (4.5), the resulting preconditioner for the discrete Steklov- 
Poincare operator is given by 

~T 



^—1 7 ^- 

^ hierarchical 



Cs = T-i 

where we use the inverse basis transformation matrix 



(4.7) 



1 



h 



The construction of hierarchical preconditioners to (4.5) is based on the fol- 
lowing result [19]: 
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Theorem 4.1. Let Vh € VhiPs) be an arbitrary, given function. The cor- 
responding coarse grid function vjj is obviously the linaer interpolant with 
respect to the coarse grid, i.e. vh = Ih^h- Then there holds the stability 
estimate 

W'^h - VH\\m/^{rs) - ^(^5^) ' ll^/i|lHi/2(r5) (4.8) 

with the positive constant 

K{h^H) = • (log 7)”“^ withj = . (4.9) 

n 

The proof of Theorem 4.1 is based on the Sobolev embedding theorem, 

|u(x) - u{y)f < c • /I'f x,y £ T*, hk = diam/i* 
for < s < n — 1 (n = 2, 3) with the constant 

L{s) = {2s-n-\-iy-^ 

describing explicitly the dependence on s and n. The above estimate implies 
the error estimate for the linear interpolation function uj, 

\\u - ^ ^ ‘ ‘ • ll^lfH-(r) for 0 < t < 1, t < 5; 

and combining this estimate with the inverse inequality for Vh{Ts) provides 
the inequality (4.8) with (4.9) in Theorem 4.1. 

As a consequence of this theorem we conclude estimates of the spectral 
condition number of the preconditioned system by using different precondi- 
tioning techniques. Here we will summarize the results, for a more detailed 
representation we refer to a forthcoming paper. 

Neglecting the coupling terms in (4.5) we get the BPS-type preconditioner 



Cbps = 



Shh 0 \ 

0 Shh J 



which gives the estimate 

< ci-[l + K(h,H)}. 

For a preconditioner of (4.5) which maintains the coupling, we factorize the 
block matrix (4.5) and choose 



CcGS = 



nfT 0-1 




Shh 0 
0 Shh 

as preconditioner which provides the estimate 

i^iCchsSh) < oi-K{h,H). 
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Both preconditioners need a preconditioning matrix Chh which is spectrally 
equivalent to Shh- For this preconditioning one may use different approaches 
such as the ones based on the results in chapter 3 or of Neumann-Neumann 
type [23]. In any case, however, the spectral condition number of the pre- 
conditioned system will depend of the ratio H/h^ which ist not optimal. To 
avoid this dependence, we note that we have the optimal bound 

K(Df^^Sh) < cs 

due to Theorem 2.2. So we can use the explicitly given matrix Dh as a precon- 
ditioner of the implicitly given Schur complement matrix Sh- To realize the 
action of this preconditioner, i.e. to multiply a given vector v by , one can 
use either a linear multigrid scheme [8] or any other iterative scheme precon- 
ditioned by the hierarchical methods described above. Here we will compare 
all three preconditioners for a simple model problem with the Laplacian for 
a domain decomposition of the unit square into 64 subdomains. N denotes 
the local degree of freedom in the subdomains, while the iteration number 
and the time correspond to an error reduction of 10“^: 



Table 4.1. Preconditioners for 5/^. 





BPS 


CGS 


CG(Dh) 1 


N 


Iter 


sec 


Iter 


sec 


Iter 


sec 


64 


24 


22.52 


20 


26.95 


11 


41.28 


128 


24 


26.55 


22 


33.20 


12 


47.01 


256 


25 


45.47 


23 


52.09 


13 


62.11 


512 


27 


127.39 


25 


133.02 


13 


107.80 


1024 


28 


489.38 


26 


482.16 


14 


326.77 



Note, that the number N of 1024 local degrees of freedom per subdomain 
corresponds for 64 subdomains to a global number of 4.190.209 inner finite 
element nodes. The computations documented in Table 4.1, are carried out 
on an Intel Paragon machine. 

Comparing all three preconditioners presented, we conclude, that the ap- 
plication of by a hierarchically preconditioned conjugate gradient scheme 
is cisymptotically optimal due to the bounded number of global iterations. 
Similar results were obtained in the case of linear elasticity in [19]. 



5 Conclusions 

The Galerkin discretization of the symmetric boundary element domain de- 
composition formulation leads to a skew-symmetric, positive definite system 
of linear equations, where the Schur complement is positive definite and sym- 
metric. Both local and global preconditioners for the discrete single layer po- 
tential operators and the discrete Steklov-Poincare operators, respectively, 
may be constructed by related boundary integral operators without any ad- 
ditional work. Moreover, the construction of a preconditioner corresponding 
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to the global Schur complement system can be reduced to the discrete hy- 
persingular integral operator, which is given explicitly in the contrary to the 
nonsymmetric boundary element formulation as well as to the finite element 
domain decomposition methods. 
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Benchmark Problems for Boundary Element 
Methods 

H. Schulz, 0. Steinbach, and W. L. Wendland 



1 Introduction 

Over the last 6 years a lot of new techniques and algorithms for boundary 
element methods have been developed; e.g. for the numerical integration of 
singular integrals, for fast matrix times vector realizations as the panel clus- 
tering method or wavelet techniques to get sparse stiffness matrices; for do- 
main decomposition methods and parallel algorithms for boundary integral 
equations including preconditioning techniques; for applications to compli- 
cated three dimensional nonlinear problems in structural mechanics. 

Related to the theoretical results, computer codes have been developed by 
different groups, partially in cooperation, to incorporate, confirm and exploit 
the theory. Due to the analytically involved formulation of boundary element 
methods, we could not expect to generate an optimal code for solving general 
boundary value problems by the BEM. Instead, the current goal is now to 
compare particular components in order to find an optimal strategy for the 
future design and development of boundary element software. 

Such a comparison via benchmark problems should be done with respect 
to various criteria. With our first proposal we started a significant discussion 
on the design of benchmark problems, i.e. the problems and criteria. At this 
stage, only a qualitative comparison can be expected. The ultimate aim will 
be a quantitative comparison with respect to the computational costs for 
results within a given tolerance. Benchmark problems should be simple on 
one hand, but they should also contain typical features of real applications. 
As model problems we have to include the Laplace equation and problems 
in linear elasticity in two and three dimensions. A crucial task is the numer- 
ical integration of the singular surface integrals for the discretization of the 
boundary integral operators, i.e. the amount of work for the corresponding 
computer realization. In general, this has to be related to the accuracy of 
the numerical solution, so we have to check also the order of convergence 
by applying academic examples. Another important topic is the solution of 
the resulting systems of linear equations by preconditioned iterative solvers 
including wavelet and panel clustering techniques. The discussion of perfor- 
mance is in general based on the formulation of the boundary value prob- 
lem as a system of boundary integral equations, i.e. the direct or the indi- 
rect approach; the discretization of these systems, either by collocation or 
Galerkin methods, or more general combinations of these. Error estimators, 
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mesh adapting algorithms and /ip— methods are needed to get an optimal 
relation between the amount of work and the required accuracy. Clearly, the 
costs of an estimator should be low. Due to the fast development of parallel 
computer architectures, domain decomposition methods and the paralleliza- 
tion of existing codes are useful to handle coupling problems and problems 
with a large number of degrees of freedom. However, these methods require 
an adequate formulation and appropriate preconditioning techniques. 

The most important criterion for a boundary element realization should 
be the robustness of the code with respect to singular perturbations either 
in geometry or in given boundary data. In general we are interested in the 
numerical amount of work to get a specified accuracy of the approximate 
solution, and in the way how this result is obtained. 

The comparison of existing boundary element codes depends strongly on 
the individual realization, the programming language and on the computer 
architecture used. Hence we have to use a simple program including main 
features of the boundary element methods to scale up all computers used in 
the comparison. 

Note, that the discussion of benchmark problems - not only for boundary 
element methods - is still in progress. Our main goal is not only to find out 
and to demonstrate, which method can solve which class of problems, we also 
want to provide a basis for modern techniques in boundary element methods 
and their application in professional software codes. 



2 A first attempt for defining benchmark problems 

As a simple model problem for the comparison of the numerical integrations 
of the boundary integral operators as well as of the solution procedures for the 
resulting systems of linear equations we consider first the Laplace equation 
in a two-dimensional L-shaped domain with either Dirichlet or Neumann 
boundary conditions. To check the order of convergence we used as an explicit 
solution the fundamental solution with a source point outside the domain. 
The main task is then to solve this boundary value problem by computing the 
remaining Cauchy datum and the solution along a described line inside the 
domain. In general, there are different possibilities for solving this problem 
as described in the introduction. 

The first benchmark problem was chosen as follows: 

Au{x) =0 for a: G = [-0.25, 0.25]2\[-0.25, 0]^ 

Dirichlet problem: 

u{x) = In |x — X* I for x £ F. 

Neumann problem: 

du . (x — x*,n{x)) 



for X € r. 
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The given source point is x* = (-0.01, -0.01)^. 

As a more complicated problem we consider in the domain a mixed 
boundary value problem in two-dimensional linear elasticity with plain strain, 
where the boundary conditions are given according to the following sketch: 




As material parameters we choose for the Young modulus E = 200.000 
and for the Possion ratio z/ = 0.3. In addition to the complete Cauchy data, 
the displacements and the main stresses along G are required. 

As a three-dimensional problem in linear elasticity we consider an L-cube, 
the so-called Fichera corner as sketched in the following figure: 




The length of the edges of the cube are two units, while the length of the 
Fichera edge is one unit. The material parameters E and z/ are the same as 
in the two-dimensional problem. Mixed boundary conditions are given along 




494 H. Schulz, O. Steinbach, W. L. Wendland 
the faces as 

z = 0 : ti = t 2 = ts = 0] 
z = 2 : ti = t 2 = Us = 0] 

x = 0:ti = -1000,^2 =^3 == 0 ; 

y = 2:ti=U2=ts=0] 
y = 0 :ti =t 2 = ts = 0; 

X = 2 : ui = t2 =ts =0 

and along the sides of the Fichera edge: 

Ui =U2 = Us — 0 . 

Again we require the remaining Cauchy data and the displacements and the 
principal stresses along G. 



3 Results 

Here we present some results for the above benchmark problems obtained by 
different groups in the priority research programme. For a detailed description 
of the algorithms used we refer to the corresponding contributions in this 
volume. 

The uniform discretization of the Dirichlet problem for the Laplace equa- 
tion gives for N = 1024 boundary elements an absolute -error of the 
normal derivative of approximately 0.06 for piecewise constant trials. This 
accuracy was obtained by either a Galerkin discretization of Symm’s integral 
equation or with the method of difference potentials by A. Hommel (Chem- 
nitz). To get moderately better convergence results one may use parallel 
computers to realize a domain decomposition with N = 4096 boundary ele- 
ments and obtain an error of 1.41 • 10“^, this result was presented by M. Kuhn 
(Linz). Due to the large number of degrees of freedom, it seems more prefer- 
able to use an adaptive discretization, as by M. Maischak (Hannover). With 
an adaptive /ip-version he got an L^-error of 10“^ by using just 100 bound- 
ary elements. However, such an adaptive discretization requires the use of 
an appropriate preconditioner to limit the number of iterations needed. For 
the /ip-method, the simplest choice is diagonal preconditioning, which brings 
some improvement only in the adaptive case. Then the number of iterations 
for the adaptive version grows only as in the non-preconditioned uniform 
case. For the Dirichlet problem also collocation methods were presented by 
R. Bialecki (Erlangen). Since he does not use piecewise constant trials, he 
has to formulate additional equations at the Dirichlet corners or to use non- 
uniform trial functions. In the first cause, the result depends on the special 
choice of these additional equations. Note, that in the case of a Galerkin 
formulation there are no such problems. 
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The mixed boundary value problem in linear elasticity was also solved 
via different approaches. Collocation methods based on the boundary inte- 
gral equation including only single and double layer potentials were used 
by M. Kreienmeyer (Hannover) for an uniform mesh comparing a sequen- 
tieal with a parallel implementation as well as an adaptive mesh refinement. 
E. Schnack and his crew (Karlsruhe) used a macro boundary element coupled 
with a finite element method where the Neumann-Dirichlet map is realized 
by a Neumann series based on the discrete double layer potential. The pro- 
gram package BEAT was used by G. Kuhn and his group (Erlangen) to 
solve the two-dimensional problem in linear elasticity. Due to the use of only 
one boundary integral equation for collocation one cannot expect a partic- 
ularly efficient solution method. More suitable is the symmetric formulation 
discretized by some Galerkin-Bubnov method. This method was used by 
U. Brink (Hannover) based on the code by M. Maischak (Hannover), as well 
as in the coupled finite-boundary element software package by U. Danger and 
his co-workers (Linz). 

The situation for the three-dimensional problem in linear elasticity is 
quite similar. Here codes based on collocation were used by G. Kuhn (Er- 
langen), L. Gaul (Stuttgart), E. Schnack (Karlsruhe) and R. Quatember 
(Stuttgart). Most of these codes are based on adaptive numerical integra- 
tion. The only Galerkin discretization is due to H. Andra (Karlsruhe), who 
realized the symmetric formulation in three-dimensional linear elasticity. 



4 Conclusions 

The problems and results shown here present a first attempt for defining 
a set of benchmark problems for boundary element methods. Up to now 
we still discuss the choice of benchmarks as well as the presentation of the 
results. In this first set of problems we asked mostly for the solution of mixed 
boundary value problems, where the results are given either in tables or by 
graphics. Hence, still a fair and detailed comparison of the results and of the 
methods used, is not possible. Therefore we startet a second attempt stating 
the problems and the results required more detailed in corresponding tables 
[1]. Everybody is invited to join this forum, to propose benchmark problems 
and to describe specified results to be listed in those tables. 

The main goal hereby should be the comparison of mathematical methods, 
i.e. the numerical amount of work with respect to the accuracy of the solution. 
Moreover, we are also interested in the solution of real application problems. 
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